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(54) Tide: NOVEL CHIMPANZEE ERYTHROPOIETIN (CHEPO) POLYPEPTIDES AND NUCXEIC ACIDS ENCODING THE SAME 

human MGVHECPAWLWLLIiSLLSLPLGLPVLGAPPRLICDSRVLERYLLEAKEAE . 

************************************************** 

chepo MGVHBCPAHLWLLLSLLSLPLGLPVLGAPPRLICDSRVLERYLLEAKEAE 
human NITTGO^CSLNENlTVPDTKVKFYAWKRriBVGQQAVEW 
chepo NITTGCAEKCSLNENXTVPOTKVNFYAWKRMEVRQQAyEVWQGL^ 



human VLRGQALLVNSSQPWBPLQLKVBKAVSGLRSLTTLLRALGAQKEAISPPD 
***************************************** ******** 

chepo VLRGQALLVNSSQFHEPIiQUIVDKAVSGLRSLTTIiLRALGAXKEAISPP 



human AASAAPLRTITADTFRKLFRVYSNFliRGXLKLYTGEACRTGDR 
******************************************* 

chepo AASAAPLRTITADTFRKLFRVYSNFLRGKLKLYTGEACRTGDR 

(57) Abstract 

The present invention is duiected to novel chimpanzee erythropoietin polypeptides and to nucleic acid molecules encoding those 
polyp(^tides. Also provided herein are vectors and host cells comprising those nucleic acid sequences, chimeric polypeptide molecules 
comprising the polypeptides of die present invention fused to heterologous polypeptide sequences, antibodies which bind to die polypeptides 
of die present invention and to mediods for pioducing the poIypq>tides.of the present invention. 
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NOVEL CHIMPANZEE ERYTHROPOiniN (CHEPO) POLYPEPTIDES AND NUCLEIC ACIDS ENCODING THE 

SAME 



RELD OF THE INVENTION 

5 The present invention relates generally to the identification and isolation of novel chimpanzee erythropoietin 

polypeptides, nudeic acid molecules encoding those polypeptides and to the recombinant production of those polypeptides. 



BACKGROUND OF THE INVENTION 
Erythropoiesis, the production of red blood cells, occurs continuously throughout the human life span to offset 
1 0 cell destruction. Erythropoiesis is a very precisely controlled physiological mechanism oiabfing sufficient numbers of red 
blood cells to be available in the blood for proper tissue oxygenation but not so many that the cdls would impede 
circulation. The formation of red blood cells occurs in the bone marrow and is under control of the hormone, erythropoietin. 

Erythropoietin, an acidic glycoprotein is spproximately 34,000 dalton molecular weight may occur in three forms: 
alpha, beta and asialo. The alpha and beta forms different slightly in carbohydrate components have the same potency, 
15 biologicet activity and molecular weight. The asialo form is an alpha or bete form with the terminal carbohydrate (sialic 
add) removed. Erythropoietin is present in a very low concentrations in plasma when the body is in a healthy state wherein 
tissues recehre suf fident oxygenation from the existing number of erythrocytes. This normal low concentration is enough 
to stimulate replacement of red blood cells which are lost normally through aging. 

The amount of erythropoietin m the drculation is increased under conditions of hypoxia when oxygen transport 
20 by blood ceils in the cvculation is reduce! Hypoxia may be caused by loss of large amounts of blood through hemorrhage, 
destruction of red blood cells fay over-exposure to radiation, reduction m oxygen intake due to high altitudes or prolonged 
unconsdousness, or various forms of anemia. In response to tissues undergoing hypoxic stress, erythropoietin will increase 
red blood ceil production by stimulating the conversion of primitive precursor cells in the bone marrow into proerythroblasts 
which subsequently mature, synthesize hemoglobin and are released into the drculation as red blood cells. When the 
25 number of red blood eels in circulation is greater than needed for normal tissue oxygen requirements, erythropoietin in 
cteulation is decreased. 

Because erythropoietin is essential m tiie process of red blood cell formation, tiie hormone has potential useful 
application in both tiie diagnosis and treatment of blood iGsoriters characterized by low or defective red blood cell 
production. See, generdy, Pemudiur-Das, et aL, iood 63(5):1 168-71 (1984) and Haddy, Am.Jour.Ped.Hematol.DncoL 
30 4:19M96 (1982) relating to eiytiirepoietin ui pessBde tiierapies for siclde eel disease, ami Esddiadi et al, J. Clin. Invest. 
74(2]:434-441 ( 1 984), ihscribii^ a timpeutic legimoi for uremic sheep based aniamo response to erytiiropowtin-rich 
plasma infusions and proposing a dosage of 10 U EOP/kg per day for 1540 days as corrective of anemia of die type 
assodated witii dironic renal failure. See also, Krane/Hannr Ford Hosn, Med. J„ 31(3):177.181 f1983). 

We describe herein the identifkation and characterization of a novel erytiiropoietm polypeptide iterhred from the 
3 5 chnnpanzee, designated hereni as CHEPO . 
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SUMMARY OF TH£ IWVEWTinw 
A cONA cione has been identified that has homology to nucleic acid encoding human erythropoietin that encodes 
a novel chimpanzee erythropoietin polypeptide, designated in the present application as "CHEPO". 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising a nucleotide sequence 
5 that encodes a CHEPO polypeptide. 

in one aspect the isolated nucleic acid molecule comprises a nucleotide sequence having at least about 80% 
nucleic acid sequence identity, alternatively at least about 81% nucleic acid sequence identity, alternatively at least about 
82% nucleic acid sequence identity, altematrady at bast about 83% nucleic acid sequoice identity, altematively at least 
about 84% nucleic acid sequence identity, altematively at least about 85% nucleic add sequence identity, alternatively 
10 at least about 86% nucleic acid sequence identity, altemathraly at least about 87% nucleic acid sequence identity, 
altematively at least about 88% nucMc acid sequence identity, altematively at least about 89% nucleic acid sequence 
identity, alternatively at least about 90% nucleic acid sequence identity, altematively at least about 91% nucleic add 
soiuence identity, altematively at least about 92% nucleic add sequence identity, altematively at least about 93% nucleic 
acid sequence identity, altematively at bast about 94% nudeic add sequence identity, attematively at least about 95% 
1 5 nucleic add sequence identity, altematively at least about 96% nudeic add sequence identity, alternatively at least about 
97% nudeic acid sequence identity, altematively at least about 98% nucleic add sequence identity and altematively at 
least about 99% nucleic add sequence identity to (a) a DNA molecule encoding a CHEPO polypeptide having tiia sequence 
of amino acid residues from about 1 or about 28 to about 193, indusive, of Figure 3 ISEQ ID N0S:2 and 5), or (b) the 
complement of the DNA molecule of (a). 
20 In anotiier aspect tiie isolated nudeic add molecule comprises (a) a nucleotide sequence encoding a CHEPO 

polypeptide having tiie sequence of ammo add residues from about 1 or about 28 to Aout 193, indu^ve, of Figure 3 (SEQ 
ID N0S:2 and 5), or (b| the complement of tiie nudeotide sequence of (a). 

In other aspects, tiie isolated nuddc add molecule comprises a nucleotide sequence having at least about 80% 
nucleic acid sequence identity, altemathrely at least about 81 % nucleic add soiuence identity, altematively at least about 

2 5 82% nucleic add sequence identity, ahematively at least about 83% nudeic acid sequence identity, altematively at least 

about 84% nudeic add sequence identity, attematively at least about 85% nudeic add sequence identity, attemathrely 
at least about 86% nuddc add sequence identity, altematively at least about 87% middc add sequence identity, 
altematively at least abmit 88% mideic add sequence identity, altematively at least about 89% nudeic add s«]uence 
identity, altematively at least about 90% nudeic acU sequence identity, attematively at least about 91 % nudeic add 

3 0 sequence identity, altematively at least about 92% nuddc acU sequence identity, altematively at least about 93% nudeic 

add sequence idoitity, ahematively at bast about 94% nucUc add sequence identity, dtematively at least about 95% 
nuddc add seipience identity, altenmtivdy at bast about 86% nudaic add sequence identity, dtematively at bast about 
97% nuddc add sequence identity, attematively at bast about 98% nudeic acid sequence identity and attematively at 
least diout 99% mideic add sequence identity to (e) a DNA motacule having ti» sequem:e of micbotides from about 1 or 
3 5 about 82 to diout 579, inclushre. of Figure 2 (SEQ ID N0:3), or (b) tfie compbment of tiie DNA mobcub of (aj. 
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In another aspect the isolated nucleic add molecule comprises (a) the nucleotide sequence of from about 1 or 
about 82 to about 579, inclusive, of Figure 2 (SEQ ID N0:3), or (b) the complement of the nucleotide sequence of (a). 

In another aspect, the invention concerns an isolated nucleic acid molecule which encodes an active CHEPO 
polypeptide as defined below comprising a nucleotide sequence that hybridizes to the complement of a nucleic acid 
5 sequence that encodes amino adds 1 or about 28 to about 193, inchishre, of Figure 3 (SEQ ID N0S:2 and 5). Preferably, 
hybridization occurs under stringent hybridization and wash conditions. 

In yet another aspect the mvention concerns an isolated nucleic add molecule which encodes an active CHEPO 
polypeptide as defined below comprising a nucleotide sequence that hybridizes to the complement of the nucleic acid 
sequence between about nucleotides 1 or about 82 and about 579, indushre, of Hgure 2 (SEQ ID N0:3). Preferably, 

1 0 hybridization occurs under stringent hybridization and wash conditions. 

In a f urtiier aspect tiie invention concerns an isolated nudeic add molecule which is produced by hybridizing a 
test DNA molecule under stringent conditions witii (a) a DNA molecule encoding a CHEPO polypeptide having tiie sequent 
of amino acid residues from about 1 or about 28 to about 193, Indushre, of Rgure 3 (SEQ ID N0S:2 and 5), or (b) tiie 
complement of the DNA molecule of (a), and, if tiie test DNA molecule has at least about an 80% nucleic add sequence 

15 identity, alternatively at least about 81 % nucleic acid s^uence identity, alternatively at least about 82% nucleic add 
sequence identity, alternatively at least about 83% nudeic add sequence identity, alternatively at least about 84% nudeic 
acid sequence identity, alternatively at least about 85% nucleic add sequence identity, alternatively at least about 86% 
nudeic acid sequence identity, alternatively at least about 87% nudeic add sequence identity, alternatively at least about 
88% nudeic add sequence identity, alternatively at least about 89% nuddc add sequence identity, alternatively at least 

2 0 about 90% nudeic acid sequence identity, alternatively at least about 91 % nuddc acid sequence identity, alternatively 
at least about 92% nuddc add sequence identity, alternatively at least about 93% nuddc add sequence identity, 
alternatwely at least about 94% nuddc acid sequence identity, alternatively at least about 95% nucleic add sequence 
identity, alternatively at least about 96% nudeic add sequence identity, alternatively at least about 97% nucleic add 
sequence identity, alternatively at least about 98% nuddc add sequence identity and alternatively at least about 99% 

25 mndeic add sequence identity to (a) or (b), and isolating tiie test DNA molecule. 

In anotiwr aspect tiie invention concerns an isolated nuddc acid molecule comprising (a) a nudeotide sequence 
encoding a pdypeptide scoring at least about 80% podtives, altemetively at least about 81% positives, altematively et 
least about 82% positives, alteroativdy at hast about 83% positives, altematively at least about 84% positives, 
dtematively et bast about 85% podtives, altematively at least about 86% positives, altematively at least about 87% 

30 positives, dternatively at least about 88% positives, ahernatiye^ 

90% podtives, altemativdy at least about 91 % podtivas, ahemativeiy at least about 92% podtives, ahemativdy at least 
about 93% podtives, altemativdy at least about 94% podthres, altemativdy at least about 95% podtives, dtemativdy 
at least about 96% podtives, ahemativdy at least abmit 97% podtives, dtemetively at bast diout 98% positives and 
altemativdy at bast about 99% podtives when compared witii the amino add sequence of redibes about 1 or about 28 

35 to 193, indudve, of Figure 3 (SEQ ID N0S.2 ami S), or (b) the complonent of tiie nucbotide sequence of (aL 
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in a specific aspect the invention provides an isolated nucleic acid molecule comprising DNA encoding a CHEPO 
polypeptide without the N-terminal signal sequence and/or the initiating methionine, or is complementary to such encoding 
nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 
to about amino add position 27 m the sequence of figure 3 (SEQ ID N0S:2 and 5). It is note! however, that the C terminai 
boundary of the signal peptide may vary, but most likely by no more than about 5 amino acids on either side of the signal 
peptide C-termtnal boundary as initially Uentified herein, where'm the C-terminal boundary of the signal peptide may be 
identified pursuant to criteria routinely employed m the art for identifying that type of amino add sequence element feg., 
Nielsen et at, PjSLM 10:1-6 (1997) and von Heinje et aU Nucl. Adds. Res, 14:46834690 (1986)). Moreover, it is also 
recognized that in some cases, cleavage of a signal sequence from a secreted polypeptide is not entirely uniform, resulting 
in more than one secreted species. These polypeptides, and the polynucleotides encoding them, are contemplated by the 
present invention. As such, for purposes of the present application, the signal peptide of the CHEPO polypeptide shown 
in Figure 3 (SEQ ID N0S:2 and 5) extends from amino acids 1 to X of Figure 3 (SEQ ID N0S:2 and 5), wherein X is any 
amino add from 23 to 32 of Figure 3 (SEQ ID N0S:2 and 5). Therefore, mature forms of the CHEPO polypeptide whkh 
are encompassed by the present invnition inchide those comprising amino adds X to 193 of Rgure 3 (SEQ 10 N0S:2 and 
5), wherein X is any amino acid from 23 to 32 of Figure 3 (SEQ ID N0S:2 and 5) and variants thereof as described below. 
Isolated nucleic add molecules encoding these polypeptides are also contemplated. 

Another embodiment is directed to fragments of a CHEPO polypeptide coding sequence that may find use as, for 
example, hybridization probes or for encoding fragments of a CHEPO polypeptide that may optionally encode a polypeptide 
comprising a binding site for an anti-CHEPO antibody. Such nudeic acid fragments are usually at least about 20 
nucleotides in length, altemathrely at least about 30 nucleotides in length, alternatively at least about 40 nudeotides in 
length, alternatively at least about 50 nudeotides In length, alternatively at least about 60 nudeotides in length, 
altemafively at least about 70 nudeotides in length, ahemathfoly at least about 80 nudeotides in length, alternatively at 
least about 90 nucleotides in length, ahematively at least about 100 nudeotides in length, altemathrely at least about 110 
nudeotides in length, altemathrely at least about 120 nucleotides in length, alternatively at least about 130 nucleotides 
in length, ahematively at least about 140 nucleotides in lengtii, altemathrely at least about 150 nudeotides in length, 
aitematively at least about 160 nudeotite in length, ahematively at least about 170 nucleotides in lengtii, altemativdy 
at least about 180 nudeotides in lengdi, ahematively at least about 190 nudeotides m length, alt^ativdy at least about 
200 nudeotides in lengtii, aitematively at bast about 250 nucleotides in lengtii, ahematively at least about 300 
nudeotides m lengtii, ahematively at bast about 350 nudratides in tangtii, aheroativdy at bast about 400 nudeotides 
in lengtii, ahemativdy at least about 450 nudeotides in bngtii ahematively at least about 500 nudeotides m length, 
Bhemativeiy at least about 600 mictaotides in tengflt ehematively at bast about 700 nudeotides in lengtii, altemativeiy 
at bast about 800 nucbotidas in bngflt altemativdy at bast about 900 nucleotides in lengtii and aitematively at bast 
about 1000 nudeotides in bn^ about means tin referenced nudeotide sequence tangtii 

ptas or mimis 10% of tiiat referenced lengtii In a preferred embodiment tfie nudeotide sequence fragment is derived from 
any coding region of tfie nudeotide seqi^ shown m Figure 1 (SEQ ID N0:1). It b noted that novel fragments of a 
CHEPO polypeptide^ncoiEng nudeotide sequence mey be determined in a routine manner by alqnmg tin CHEPO 
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poiypeptide-encoding nucleotide sequence with other known nucleotide sequences using any of a number of well known 
sequence aiignment programs and detemiining which CHEPO polypeptide-encoding nucleotide sequence f ragment(s) are 
novel All of such CHEPO polypeptide-encoding nucleotide sequences are contemplated herein and can be determined 
without undue experimentation. Also contemplated are the CHEPO polypeptide fragments encoded by these nucleotide 
molecule fragments, preferably those CHEPO polypeptide fragments that comprise a binding site for an ahti CHEPO 
antibody. 

In another embodiment the invention provides a vector comprising a nucleotide sequence encoding CHEPO or its 
variants. The vector may comprise any of the Isolated micleic acid molecules hminabove Identified. 

A host call comprising such a vector is also provided. By way of example, the host cells may be CHO cells, £ 
eoS, or yeast A process for producing CHEPO polypeptides is further provided and comimses cultur'mg host cells under 
conditions suitable for expression of CHEPO and recovering CHEPO from the cell culture. 

hi another emfaodmnt the inventhm provides isolated CHEPO polypeptide encoded by any of the isolated nucleic 
acid sequences hereinabove identified. 

In a specific aspect, the invention provfaies isolated native sequence CHEPO polypeptide, which in certain 
embodiments, inchides an amino acid sequence comprising residues from about 1 or about 28 to about 193 of Figure 3 
(SEQIO N0S:2and5). 

In another aspect the invention concerns an isolated CHEPO polypeptide, comprising an ammo acid sequence 
having at least about 80% amino acid sequence identity, alternatively at least about 81% amino acid sequence identity, 
altemathrely at least about 82% amino acid sequence identity, alternatively at least about 83% amino add sequence 
identity, alternatively at least about 84% amino acid sequence identity, alternatively at least about 85% amino acid 
sequence identity, ahemativeiy at least about 86% amino acid sequence identity, altemathrely at least about 87% amino 
acid sequence identity, alternatively at least about 88% amino acid sequence identity, alternatively at least about 89% 
amino acid sequence Mentity, alternatively at least about 90% amino acid sequence identity, alternatively at least about 
91% amino acid sequence identity, alternatively at least about 92% amino acid sequence identity, eltematively at least 
about 93% amino add sequence identity, altemathrely at least about 94% andno add sequence identity, eltematively at 
least about 95% amino add sequence identity, attematively at least about 96% amino add sequence identity, alternatively 
at least about 97% amino acid sequence identity, alternatively at least about 98% amino acid sequence Identity and 
alternatively at least about 99% amino add sequence identity to tin sequence of amino add residues from about 1 or about 
28 to about 193, mdushre. of Hgure 3 (SEQ ID N0S:2 and SI. 

In a furflier aspect the nivention concerns an isolated CHEPO polypeptide comprising an ammo add sequence 
scoring at least about 80% positives, ahematively at bast about 81 % positives, attematively at bast about 82% positives, 
altemativdy at tasot about. 83% positives^ ahematively at least about 84% posithres, altemathrely at bast about 85% 
positives, aheroativriy at bast about 86% positives, eltematively at bast about 87% positives, alternatively at bast dbout 
88% positives, dterrotively at bast about 89% podtive$, eltematively at bast about 90% positives, altemativdy at bast 
about 91% positives, dtematively at bast about 92% positives, alteraatiydy at bast about 93% positives, alternatively 
et bast about 94% positives, alternatively at bast about 95% positives, eltematively at leest about 96% positives. 
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attemativety at least about 97% positives, alternatively at least about 98% positives and alternatively at least about 99% 
positives when compared with the amino acid sequence of residues from about 1 or about 28 to about 193« inclusive, of 
Figure3(SEQIDN0S:2andS). 

In a specific aspect the imrembn provides an isolated CHEPO polypeptide without the N*tenninal signal sequence 
5 andlor the initiating methionine and is encoded by a nucleotide sequence that encodes such an amino acid sequence as 
hereinbefore described. Processes for producing the same ere also herein described, wherein those processes comprise 
Girfturing a host cdl comprising a vector which comprises the appropriate encoding nucleic acid molecule under conditions 
suitable for expression of the CHEPO polypeptide and recovering the CHEPO polypeptide from the ceD culture. 

b yet another aspect the invention concerns an isolated CHEPO polypeptide, comprising the seqimnce of amino 

1 0 acid residues from about 1 or about 28 to about 1 93, inchishre, of Rgure 3 (SEQ ID N0S:2 and S), or a fragment thereof 
which is biologically active or sufficient to provide a binding site for an anti-CHEPD antibody, wher^n the identification 
of CHEPO polypeptide fragments that possess biological acthrity or provide a binding site for an anti-CHEPO antibody may 
be accomplished in a routine manner using techniques which are well known in the art Preferably, the CHEPO fragment 
retains a qualitative biological activity of a nathre CHEPO polypeptide. 

15 In a still further aspect the invention provides a polypeptide produced by (i) hybridizing a test DMA molecule 

under stringent conditions with (a) a DNA molecule encoding a CHEPO polypeptide having the sequence of amino acid 
residues from about 1 or about 28 to about 193, inclusive, of Figure 3 (SEQ ID N0S:2 and 5), or (b) the complement of the 
DNA molecule of (a), and if the test DNA molecule has at least about an 80% nucleic acid sequence identity, alternatively 
at least about 81% nucleic acid sequence identity, altemathrely at least about 82% nucleic acid sequence identity, 

2 0 alternatively at least about 83% nucleic acid sequence identity, alternatively at least about 84% nucleic acid sequence 
identity, alternatively at least about 85% nucleic acid sequence identity, alternatively at least about 86% nucleic acid 
sequence identity, alteroatively at least about 87% nucleic acid sequence identity, altemathrely at least about 88% nucleic 
add sequence identity, alternatively at least about 89% nucleic acid sequence identity, alternatively at least about 90% 
nucleic acid sequence identity, altemathrely at least about 91 % nudeic acid sequence identity, atonatively at least about 

25 92% midefe acid sequence identity, altematively at bast about 93% nudeic add sequence identity, alternatively at least 
about 94% nudeic add sequence identity, altematively at least about 95% nudeic add sequence identity, altematively 
at least about 96% nucUc add sequem» Identity, altematively at bast about 97% nuddc add sequemn identity, 
ahematnely at least abmit 98% nuddc add sequence Uentity and alternatively at least about 99% nuddc add sequence 
identity to (a) or (b), (o) culturing a host eel comprising tiie test DNA molecule imder conditions suitable for expresdon of 

30 ttw polypeptide, and tm) recovering the polypeptide from tin ceO culture. 

fai anotiiar embodiment tfie invention provides chimeric molecules comprismg a CHEPO polypeptide fused to a 
heterdogous pdypeptid^ or ammo acid sequence, wherem the CHEPO polypeptMe may comprise any CHEPO pdypeptide, 
variant or fragment tiiereof as hernnbefore describe! An example of sodi a diimeric molecule comprises a CHEPO 
polypeptide fused to an epitope tag sequence or a Fc region of an onmunogldiuOn. 
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In another embodiment, the invention provides an antibody as defined below which specifically binds to a CHEPO 
polypeptide as hereinbefore describei Optionally, the antibody is a monodona) antibody, an antibody fragment or a single 
chain antibody. 

In yet another embodiment the invention concerns agonists and antagonists of a native CHEPO polypeptide as 
5 defmed below. In a particular embodiment, the agonist or antagonist is an anti-CHEPO antibody or a small molecule. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists to a CHEPO 
polypeptide which comprise contacting the CHEPO polypeptide with a candidate molecule and monitoring a biological 
activity mediated by said CHEPO polypeptide. Preferably, the CHEPO polypeptide is a nathre CHEPO polypeptide. 

In a still further embodiment, the invention concerns a composition of matter compri^ng a CHEPO polypeptide, 
10 or an agonist or antagonist of a CHEPO polypeptide as herein described, or an anti-CHEPO antibody, in combination with 
a carrier. Optionally, the carrier is a pharmaceutically acceptable carrier. 

Another embodiment of the present mvention is directed to the use of a CHEPO polypeptide, or an agonist or 
antagonist thereof as herein described, or an anti-CHEPO antibody, for the preparation of a medicament useful in the 
treatment of a condition which b responshre to the CHEPO polypeptide, an agonist or antagonist thereof or an anti-CHEPO 
15 antibody. 

Yet another embodiment of the present invention is directed to CHEPO polypeptides having altered gtycosyiation 
patterns in one or more regions of the polypeptide as compared to the nathre sequence CHEPO polypeptide, preferably in 
the region surrounding and/or including amino acid position 84 in the CHEPO amino adds sequence shown in Figure 3 (SEO 
ID N0S:2 and 5). In various embodiments, CHEPO variant polypeptides are prepared using well known techniques so as 

20 to create an or O-linked glycosylatton site at or near amino add position 84 in the CHEPO polypeptide sequence. For 
example, CHEPO polypeptides contemplated by tiie present invention include those where (a) amino acids 81-84 of the 
CHEPO amino add sequence shown in Figure 3 (SEQ ID N0S:2 and 5) (Le., Met-Glu-Val-Arg; SEO 10 N0:6| are replaced by 
the amino add sequence Asn-X-Ser-X (SEQ ID N0:7) or Asn-X-Thr-X (SEQ ID N0:8), where X is any amino add except for 
Pro; (b) amino adds 82-85 of the CHEPO amino add sequence shown in Figure 3 (SEQ ID NQS:2 and 5) fte., Glu-Val-Arg- 

25 Gin; SEQ ID NO:g) are replaced by the amino add sequence Asn-X-Ser X (SEQ ID N0:7) or Asn-X-Thr X (SEQ ID N0:81, 
where X is any amino add except for Pro; (c| amino adds 83^8 of the CHEPO amino add sequence shown in Figure 3 (SEQ 
ID N0S2 ffiid S) li^ Val-Arg-Gln-GIn; SEQ ID N0:10) are replaced by the ammo add sequem» Asn-X-Sv-X (SEQ ID N0:7) 
or Asn-X-Thr-X (SEQ ID NQ:8), where X is any amino add except for Pro; or (d) amino adds 84-87 of the CHEPO amino 
add sequems shown in FQUie 3 (SEQ ID N0S2 and 5) Arg-Gb^^ 

30 seqinnce Asn-X-Ser-X (SEQ ID N0:7) or Asn-X-Thr-X (SEQ ID N0:8L where X is any amino acid except for Pro, thereby 
matins an N-glycosylatioR site at those positions. Nudeic adds encodnig these variant polypeptides are also 
contemplated herein as are vectors and host cds comprismg those 

BRIEF DESCRIPTIOM OF THE DRAWIMGS 
3 5 Figures 1 A-C show a nucleotide seaquence (SEQ ID N0:1) of an isolated genomic DNA molecule eontabnno a 

nudeotUe seiuence (mideotides 134-148, 667412, 1071-1157, 1760-1939 and 3174-2226, exduswe of others) 

7 
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encoding native sequence CHEPO. Also presented in the genomic sequence are the locations of the start codon, exons and 
introns as wed as the amino acid sequence (SEQ ID N0:2) encoded by the coding sequence of SEQ 10 N0:1. 

Figure 2 shows the cDNA sequence of the CHEPO molecule (SEQ ID NG:3) and the amino acid sequence encoded 
thereby (SEQ ID N0:2). 

5 Rgure 3 shows a comparison of the human erythropoietin amino acid sequence (human) (SEQ ID N0:4) and that 

of the chimp erythropoietin (chepo) described herein, wherein the amino acid designated X at amino acid position 142 of 
the CHEPO sequence is either glutamine (SEQ ID N0:2) or lysine (SEQ ID N0:5). 

DETAILED DESCRiPTlO W OF THE PREFERRED EMBODIMENTS 
10 I. Definitions 

The terms "CHEPO polypeptide", 'CHEPO protein" and "CHEPO" when used herein encompass nathra sequmce 
CHEPO and CHEPO polypeptide variants (wtdch are further defmed herein). The CHEPO polypeptide may be isolated from 
a variety of sources, such as from human tissue types or from another source, or prepare by rmmbinant and/or synthetic 
methods. 

15 A "native sequence CHEPO" comprises a polypeptide having the same amino acid sequence as a CHEPO derived 

from nature. Such native sequence CHEPO can be isolated from nature or can be produced by recombinant and/or synthetic 
means. The term "native sequence CHEPO" specifically encompasses naturaHy-occurrmg truncated or secreted forms {fi4f„ 
an extracellular domain sequence), naturally-occumng variant forms le.g., altemathrely spliced forms) and naturally- 
occurring allelic variants of the CHEPO. in one embodiment of the invention, the native sequence CHEPO is a mature or 

2 0 fulMengtii native sequence CHEPO comprising amino acids 1 to 1 93 of Figure 3 (SEQ ID N0S:2 and 5). Abo, whfle the 
CHEPO polypeptides disclosed in Figure 3 (SEQ ID N0S:2 and 5) is shown to begin with the methionine residue desolated 
herdn as amnio acid position 1, it is conceivable and possible that another methionine residue located either upstream or 
downstream from amino acid position 1 in Figure 3 (SEQ ID N0S:2 and 5) may be employed as tiie starting amino acid 
residue for the CHEPO polypeptide. 

2 5 "CHEPO variant polypeptide" means an active CHEPO polypeptide as defined below having at least about 80% 

amino add sequence identity witii the amino acid sequence of (a) residues 1 or about 28 to 193 of the CHEPO polypeptide 
shown in Rgure 3 (SEQ ID N0S:2 and 5). (b) X to 193 of the CHEPO polypeptide shown m Figure 3 (SEQ ID N0S:2 ami 5), 
wherein X is any amino add residue from 23 to 32 of Rgure 3 (SEQ ID N0S:2 and 5) or (c) anotiier spedficaOy derived 
fragment of tin ammo add sequence shown in Rgure 3 (SEQ ID mS2 and 5). Sudi CHEPO variant polypeptides indude, 
30 for instance, CHEPO polypeptides wherem one or more ammo add residues are added, or deleted, at tiie N- and/or C- 
terminus, as well as wMmi one or mora internal domains, of tiia sei^ence of Rgure 3 (SEQ ID N0S:2 and S). Ordinarily, 
a CHEPO variam polypeptide vviO have at least d»ut 80% ante 

amino aehl sequence identity, aitematively at least about 82% amino add sequencejdentity, atonatively at least about 
83% amino add sequence identity, altemativdy at least about 84% amino add sequence identity, dtemativefy at least 

3 5 about 85% ammo add sequence identity, aitematively at least about 86% ammo add sequence identity, altemativdy at 

bast dimit 87% anm add sequence Mentity, aheniativdy at 1^^ 
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at least about 89% amino acid sequence identity, alternatively at least about 90% amino acid sequence identity, 
alternatively at least about 91% amino add sequence identity, alternatively at least about 92% amino acid sequence 
identity, alternatively at least about 93% amino acid sequence identity, alternatively at least about 94% amino acid 
sequence identity, alternatively at least about 95% amino acid sequence identity, alternatively at least about 98% amino 
5 acid sequence identity, alternatively at least about 97% amino acid sequence identity, alternatively at least about 98% 
ammo acid sequence idoitity and altematively at least about 99% amino acid sequence identity with (a) residues 1 or about 
28 to 193 of the CHEPO polypeptide shown in Figure 3 (SEQ ID N0S:2 and 5), (b) X to 193 of the CHEPO polypeptide 
shown in Rgure 3 (SEQ ID N0S:2 and SL wherein X is any amino acid residue from 23 to 32 of Rgure 3 (SEQ ID N0S:2 
and 5) or (c) another specirically derived fragment of the amino acid sequence shown in Hgure 3 (SEQ ID N0S:2 and 5). 

10 CHEPO variant polypeptides do not encompass the native CHEPO polypeptide sequence. Ordinarily, CHEPO variant 
polypeptides are at least about 10 amino adds in length, altematively at least about 20 amino acids in length, altemathrely 
at least about 30 amino acids in length, atternatraely at least about 40 amino acids in length, alternatively at least about 
50 amino acids in length, altematively at least about 60 amino acids in tength, altematively at least about 70 amino acids 
ai length, alternathrely at least about 80 amino adds in length, alternatively at least about 80 amino acids in length, 

15 alternathrely at least about 100 amino adds in length, altematively at least about 150 amino acids in lengtii, alternathrely 
at least about 200 amino acids in length, altematively at least about 300 amino acids in length, or more. 

"Percent (%) amino add sequence identity" with respect to the CHEPO polypeptide sequences identified herem 
is defined as the percentage of amino add residues in a candidate sequence tiiat are identical witii die amino add residues 
m a CHEPO sequence, after allgnmg the sequences and introducing gaps, if necessary, to achieve the maximum percent 

20 sequence identity, and not considering any conservative substitutions as part of the sequence identity. AKgnment for 
purposes of determining perrant amino add sequence identity can be achieved in various ways tiiat are within the skill in 
tiie art for instance, using pubficly available computer software such as BLAST, BLAST-2, ALIGN, ALiGN*2 or Megalign 
(DNASTAR) software. Those skilled in the art can determine appropriate parameters for measuring alignment, including 
any algorithms needed to achieve maximal afignment over tiie full-lengtfi of tiie sequences being compared. For purposes 

2 5 herein, however, % amino add sequence identity vahies are obtained as described below by usmg tiie sequence comparbon 

computer program AUGN>2, wherein tiie complete source code for tiie ALIGN-2 program is provided in Table 1 below. The 
ALlGN-2 sequence comparison computer program was autiiored by Genentech, Inc. and tiie source code shown in Table 
T has been filed witii user documentation in tiie U.S. Copyright Office. Washington D.C., 20559, whero it is registered 
tuider U^. Copyright Registration No. TXU510087. The AUGN'2 program is publicly available tiirough Genentedi, Inc., 

3 0 Sooth San Frandsco, Cdfomia or may be compaed from the souroe code provided in Table 1. Tin AUGN-2 progrom should 

be compSed for use on a UNIX qserating system, preferably digital UNIX V4.0D. AH sequence comparison parameters are 
set by tiio AU6N-2 program and do not vary. 

For purposes herein, die % ammo add sequence Uentity of a ghren amino add sequmce A to, withf or against 
a ghren amino add sequence B (which can alternatively be phra^ 
35 a certain % amino add sequence identity to, with, or against a ghren anino add sequence B) Is cakuiated as f oDows: 
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100 times the fraction X/Y 

where X is the number of amino acid residues scored as identical matches by the sequence alignment program ALIGN'2 
in that program s alignment of A and B, and where Y is the total number of amino acid residues In B. it will be appreciated 
that where the length of amino add sequence A is not equal to the length of amino acid sequence B, the % ammo acid 
sequence identity of A to B win not equal the % amino add sequence identity of B to A. As examples of % amino add 
sequence identity calculations, TaUes 2 and 3 below demonstrate how to calculate the % amino add sequence identity 
of the amino add sequence designated Comparison Protein to the amino acid sequence designated PRO . 

Unless specifically stated otherwise, all % ammo acid sequence id«itity values used herein are obtained as 
described above using the AlJGN-2 sequence comparison computer program. However, % ammo add sequence identity 
may also be determined using the sequence comparison program NCBI-BLAST2 (Altschul at at.. Nucleic Adds Res. 
25:3389-3402 (19971). The NCBI-BLAST2 sequence comparison program may be downloaded from 
http://wwwjicbLnlnLnih.gov or otherwise obtained from the National Institute of Health, Bethesda, MD. NCBI-BLAST2 
uses several search parameters, wherein all of those search parameters are set to default values induding, for example, 
unmask - yes, strand - all, expected occurrences - 10, minimum low complexity length - l5/5, multi-pass e-value - 
0.01, constant for multi-pass - 25, dropoff for final gapped alignment • 25 and scoring matrix - BL0SUM62. 

In situations where NCB1BLAST2 is employed for amino add sequeiu:e comparisons^ the % amino acid sequence 
identity of a given amino add sequence A to, with, or against a given amino acid sequence B (which can alternatively be 
phrased as a given amino acid sequence A that has or comprises a certain % amino add sequence identity to, with, or 
against a ghren amino add sequence B) is calculated as follows: 

100 times the fraction X/Y 

where X is the number of amino add reddues scored as identical matches by the sequence elignment program NCBI- 
BIAST2 in that program s alignment of A end B, and where Y is the total number of amino add residues in B. It will be 
appredated that where the length of amino add sequence A is not equal to the length of amino add sequence B, the % 
amino acid sequence identity of A to B will not equal the % amino acid sequence identity of B to A. 

*CHEPO variant polynucleotide" or CHEPO variant nucleic add sequence means a nuddc add molecule which 
encodes an active CHEPO polypeptide as deftnad below and which has at hast about 80% ntideic add sequence identity 
witfi ddier (a) a nuddc add sequence wMch encodes reddues 1 or about 28 to 183 of the CHEPO polypeptide shown in 
Figure 3 (SEQ ID N0S2 ond SL (b) a nuddc add sequence which encodes rosidues X to 193 of the CHEPO polypeptide 
shown in Rgure 3 (SEQ ID NGS:2 and 5), wherein X is any ammo add reddue from 23 to 32 of Figure 3 (SEQ ID N0S:2 
and 5) or (cl a nuddc add sequence which encodes anotfieripedfiailly derived fragment of the amino add sequence 
shown in Figure 3 (SEQ ID NOS^ and 5). Onfinarily, a CHEPO variant polynucleotide will have at least about 80% nudeic 
add sequence identity, dtematively at least about 81% nudete add sequence Mentity, altemathrely at least about 82% 
mtddc add sequence UBntity, aiteraatively at least about 83% nudeic add sequence identity, alternatively at bast about 

10 
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84% nucleic acid sequence identity, alternatively at least about 85% nucleic acid sequence Identity, alternatively at least 
about 86% nucleic acid sequence identity, alternatively at least about 87% nucleic acid sequence identity, alternatively 
at least about 88% nucleic acid sequence identity, alternatively at least about 89% nucleic acid sequence identity, 
alternatively at least about 90% nucleic acid sequence identity, alternatively at least about 91% nucleic acid sequence 
identity, alternatively at least about 92% nucleic add sniuence identity, alternatively at least about 93% nucleic acid 
sequence Uentity, alternatively at least about 94% nucleic add sequence identity, alternatively at least about 95% nucleic 
add sequence Identity, alternatively at least about 96% nucleic acid sequence identity, alternatively at least about 97% 
nucleic add sequence identity, alternatively at least about 98% nuddc add sequence identity and alternatively at least 
about 99% nucleic add sequence Identity with either (a) a nucleic acid sequence which encodes residues 1 or about 28 
to 193 of the CHEPO polypeptide shown in Figure 3 (SEQ ID N0S:2 and 5), (b) a nucleic acid sequence whrch encodes 
residues.X to 193 of tin CHEPO polypeptide shown in Fqure 3 (SEQ ID N0S:2 and 5), wherein X is any amino add residue 
from 23 to 32 of Rgure 3 (SEQ ID N0S:2 and 5) or (c) a nucleic add sequence which encodes another spedfically derhred 
fragment of the amino acid sequence shown in Figure 3 (SEQ ID N0S:2 and 5K CHEPO polynucleotide variants do not 
encompass the native CHEPO nucleotide sequence. 

Ordinarily, CHEPO variant polynucleotides are at least about 30 nudeotides in length, altemathrely et least about 
80 nucleotides in length, alternatively at least about 90 nucleotides in length, altemativeiy at teast about 120 nudeotides 
in length, alternatively at least about 150 nucleotides in length, alternatively at least about 180 nudeotides in length, 
alternatively at least about 210 nucleotides in lengti), alternatively at least about 240 nudeotides in length, ahemathrely 
at least about 270 nudeotides m lengtii, altemativeiy at least about 300 nudeotides in lengtii, altemativeiy et least about 
450 nudeotides in lengtii, altemativeiy at least about 600 nucleotides in length, atonatively at least about 900 
nudeotides in lengtii, or more. 

"Percent (%) nucleic add sequence identity" with respect to the CHEPO polypeptide-encoding nudeic acid 
sequences identified herein is defined as tiie percentage of nucleotides in a candidate sequence tiiat are identical witij the 
nudeotides in a CHEPO polypeptide-encoding nudeic acid sequence, after aligning tiie sequences and introdudng gaps, if 
necessary, to achieve tiie maximum percent sequence identity. Alignment for purposes of determining percent nudeic acid 
sequence Identity can be achieved m varniis ways tiiat are witiun tiie skill in tiie art for instance, usmg pubGdy avadable 
computer software sudi as BLAST, BLAST.2, AU8M AUGN-2 or MegaGgn (ONASTAR) software. Those sknied in tiie ert 
can deteraiine appropriate parameters for measuring aOgnment induding any algoritiuns needed to achieve maximal 
aBgnment over tiie fulHengfli of the sequences being compared. For purposes herein, however, % nucleic acid sequence 
identity vahies are obtamed as described below by using the sequence comparison computer program ALI6N-2, wherein 
the compbte source code for the ALI6N-2 progran is provided In Table 1. The AU6N-2 sequence comparison conqiuter 
program was audiored by Genentedi Inc. and the source code shovvn in Table 1 has been filed witfi user documentation 
to the U.S. Copyright Offno, Washington D.C., 20559, where it is registered under U.S. Copyright Registration No. 
TXU5100B7. The AUGN-2 program is poUicly available tivough Oenentedi, Inc Soutfi San Frandsco, Carrfomia or may 
be compiled from tiie source code provided in Table 1. The ALIQV-2 program shoidd be compiled for use on a UNIX 
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operating system, preferably digital UNIX V4.0D. All sequence comparison parameters are set by the AU6N-2 program 
and do not vary. 

For purposes herein, the % nucleic acid sequence identity of a given nucleic acid sequence C to, with, or against 
a given nucleic acid sequence D (which can alternatively be phrased as a given nucleic acid sequence C that has or 
5 comprises a certain % nucleic acid sequence identity to, with, or against a given nudetc acid sequence D) is calculated as 
follows: 

100 times the fraction W/Z 

1 0 where W is the number of nucleotides scored as identical matches by the sequence alignment program ALIGN-2 in that 
program s alignmoit of C and D, and where Z is the total number of nucleotides in 0. It will be appreciated that where 
the length of nucleic add sequence C is not equal to the length of nucleic acid sequence D, the % nudeic add sequence 
identity of C to D wHI not equal the % nudeic acid sequence identity of D to C. As examples of % nudeic acid sequence 
identity calculations, Tables 4 and 5 below demonstrate how to calculate the % nucleic acid sequence identity of the 

15 nudeic acid sequence designated Comparison ONA to the nudeic acid sequence designated PRO-DNA . 

Unless specifically stated otherwise, all % nucleic add sequence identity values used herein are obtained as 
described above using the ALIGN-2 sequence comparison computer program. However, % nudeic add sequence identity 
may also be determined using the sequence comparison pregram NCBI-BLAST2 (Altschul et a!.. Nucleic Adds Res. 
25:3389-3402 (1997)). The NCBIBLAST2 sequence comparison pregram may be downloaded from 

2 0 http://www.nd)i.nlnLnih.gov or otherwise obtained from the National faistitute of Health, Bethesda, MD. NCBI-BLASt2 

uses several search parameters, wherdn all of those search parameters are set to default values induding. for example, 
unmask - yes, strand - all, expected occurrences - 10, minimum low complexity length - 15/5, multi-pass e-valun - 
0.01, constant for multi-pass - 25, dropoff for final gapped alignment - 25 and scoring matrix - BLOSUM62. 

In situations where NCBI-BLAST2 is employed for sequence comparisons, the % nucleic add sequence identity 
25 of a given nudeic add sequence C to, with, or against a gwen nucleic add sequence D (whidi can altemathrely be phrased 
as a g»en nudeic add sequence C that has or comprises a certain % nudeic add sequence identity to, with, or against 
8 given nucleic acid sequence D) is calculated as follows: 

lOOtimes the fraction WfZ 

30 

where W is the mimber of ninleothles scored as identical matdies by the sequence afignment program NCBI-BLAST2 in 
that program s aigiuiient of C and D; and wtere Z is the total number of nudeotnles m D. it wOl be amvectated that where 
the length of nudeic add sequence C is not equal to the length of mideic add sequence D, the % nuddc acid sequenice 
tdemitv of C to D wB not equal the % micle» add sequence Menfity of ^ 

3 5 In other embodiments. GHEPQ variant polynudeotides are nudeic add mdecutes that em»de an active CHEPO 

polypeptide and whidi are capable of hybriiBziog, preferably under stringent hybridization and wash conditions, to 
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nucleotide sequences encoding the fulWength CHEPO polypeptide shown in Rgure 3 ISEQ ID N0S:2 and 5). CHEPO variant 
polypeptides may be those that are encoded by a CHEPO variant polynucleotide. 

The term "positives", in the context of the amino acid sequence identity comparisons performed as described 
above, includes amino acid residues in the sequences compared that are not only identical, but also those that have similar 
5 properties. Amino acid residues that score a positive value to an amino acid residua of interest are those that are either 
identical to the amino acid residue of interest or are a preferred substitution (as defined in Table 6 below) of the amino acid 
residue of interest. 

For purposes hereia the % value of posttwes of a given amino acid sequence A to, with, or against a given amino 
acid sequence 6 (which can alternatively be phrased as a given ammo acid sequence A that has or comprises a certain % 
1 0 positives to, with, or against a gnren amino acid sequence B) is calculated as follows: 

100 times the fraction X/Y 

where X is the number of amino add residues scoring a posithre vahie as defined above by the sequence alignment program 

1 5 ALIGN-2 in that program s alignment of A and B, and where Y is the total number of amino acid residues in B. It will be 
appreciated that where the length of amino acid sequence A is not equal to the length of amino acid sequence B, the % 
positives of A to B will not equal the % positives of B to A. 

"Isolated," when used to describe the various polypeptides disclosed herein, means polypeptide that has been 
identified and separated and/or recovered from a componott of its natural environment Preferably, the isolated polypeptide 

20 is free of association with all components with which it is naturally associated. Contaminant components of its natural 
environment are materials that would typically interfere with diagnostic or tiierapeutic uses for the polypeptide, and may 
include enzymes, hormones, and other proteinaceous or non-proteinaceous solutes. In preferred embodiments, the 
polypeptide will be purified (1) to a degree sufficient to obtain at least 15 residues of N-terminal or internal amino acid 
sequence by use of a spinning cup sequenator, or (2) to homogeneity by SOS-PAGE under non-reducing or reducing 

25 conditions using Coomassie blue or, preferably, silver stain, isolated polypeptide includes polypeptide in situ m^m 
recombinant cells, since at least one component of tiie CHEPO natural environment will not be present Ordinarily, 
however, isolated polypeptide wdl be prepared by at least one purification st^ 

An "isolated" nudeic add orahcuie encoding a CHEPO polypeptide is a nudeic acid molecule tiiat is identified 
and separated from at bast one contammant mideic add molecule with which it is ordinarily assodated in tiie natural 

30 sourraoftheCHB'Owodingnuddcadd. Preferably, the isdatedmadeic is free of assodationvvirti^ 

whidi it B naturally assodatei An isolated CHEPO-encoiEng nudeic add mohaib is other tlian in the form or setting in 
vvhich it is found in nature, bdated nuddc add mobcules tiieref ore are distinguished from the CHEPO^coding nudeic 
acid motocuta as it axbts fai natural calls. However, an bdated nuddc add mobcule racoding a CHEPO polypeptide 
tnchides CHEP&oicodmg nuddc add mobcubs contained in eels that ordinarily express CHEPO where, for exampbr the 

3 5 nuddc add mdecub b in a chromosomal tacation (Efferent from tiiat of naturd cells. 
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The term "control sequences" refers to DNA sequences necessary for the expression of an operably linked coding 
sequence in a particular host organism. The control sequences that are suitable for prokaryotes, for example, include a 
promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic celts are known to utilize promoters, 
polyadenylation signals, and enhancers. 
5 Nucleic acid is "operably Imked" when it is placed into a functional relationship with another nucleic acid 

sequence. For example, DNA for a presequence or secretory leader is operably linked to ONA for a polypeptide if it is 
expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked 
to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site is operably linked to a coding 
sequence if it is positioned so as to f acState translatioiL Generally, "operably rmked" means that the DNA sequences being 

1 0 linked are contiguous, and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers do not 
have to be contiguous. Linking is accomplished by ligation at convenient restriction sites. If such sites do not exist, tiie 
syntiietic oligonucleotide adaptors or Imkers are used in accordance with conventional practice. 

Tim term "antibody" is used in the broadest sense and specifically covers, for example, sirqie anti-CHEPO 
monoclonal antibodies Tmchiding agonist, antagonist, and neutrafizing antibodies), anti-CHEPO antibody compositions with 

15 polyepitopic spedficity, stngte chain anti CHEPO antiboiies, and fragments of anti-CHEPO antibodies (see below). The term 
"monoclonal antibody" as used herein refers to an antibody obtained from a population of substantially homogeneous 
antibodies, Le., the indhridual antibodies comprising the population are identical except for possible naturally-occurrtng 
mutations that may be present in minor amounts. 

"Stringency" of hybridization reactions is readily determinable by one of ordinary skill in the art and generally 

20 is an empirical calculation dependent upon probe lengtii, washing temperature, and salt concentration. In general, longer 
probes require higher temperatures for proper annealing, while shorter probes need lower temperatures. Hybridization 
generally depends on tiie ability of denatured DNA to reanneal when complementary strands are present in an environment 
below tiieir melting temperature. The higher tfie degree of desired homology between the probe and hybridizable sequence, 
tiie higher the relathre temperature which can be usei As a result, it f oBows tiiat higher relative temperatures would tend 

2 5 40 make the reaction conditions more stringent, while lower temperatures less so. For additional details and explanation 
of stringency of hybridization reactions, see Ausubel et a/^ Current Protocols m Molecular Biolnnv. Wiley Interscience 
Publishers, (1995). 

"Stringent conditions" or "high stringency conditions", as defined herein, may be identiTted by tiiose tiiat (1) 
employ tow hinic strengtii and high temperature for washing, for example 0.015 M sodium chloride/0.0015 M sodium 

30 citr8tefll.1% sodium dodecyl sulfate at 50 C; (2) employ during hybridization a denaturing agent, such as formamide, for 
enmpbr 50% (vM formamide with 0.1% bovnw senan albuminfO.1% FicoH/0.1 % potyvinylpyrrolidone/SOmM sodium 
phosphate buffa at pH B.5 with 750 mM sodhim chloridi, 75 nM sodium citrate at 42 C; or (3) employ 50% formamide, 
5 X SSC (0.75 M NaCl 0.075 M sodhim duate), SO inM sodium phosphate (pH B.B);0.1 % sodhmi pyrophosphate, 5 x 
Denhardt s solution, sonicated salmon sperm DNA (SO gfmlL 0.1% SOS, and 10% dextran sulfate at 42 C. with washes 

35 at 42 C in 0.2 X SSC (sodium chforidefsodhmi citrate) ami 50% fonnamide at 55 C, followed by a high^stringency wash 
coittistmg of 0.1 X SSC containing EDTA at 55 C. 
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"Moderately stringent conditions" may be identified as described by Sambrook et bI., Molectilar Cloninn: A 
Laboratory Manual. New York: Cold Spring Harbor Press, 1989, and include the use of washing solution and hybriifization 
conditions (e.g., temperature, ionic strength and %SDS) less stringent that those described above. An example of 
moderately stringent conditions is overnight incubation at 37 C in a solution comprising: 20% formamide, 5 x SSC (150 
5 mM NaCl IS mM trisodium duate), 50 mM sodium phosphate (pH 7.6k 5 x Denhardt s solution, 10% dextran sulfate, and 
20 mg/ml denatured sheared salmon sperm DMA, ffoDowed by washing the filters in 1 x SSC at about 37-50 C. The skilled 
artisan will recognize how to adjust the temperature, ionic strength, etc. as necessary to accommodate factors such as 
probe length and the like. 

The term "epitope tagged" when used herdn refers to a chimeric polypeptide comprising a CHEPO polypeptide 

1 0 fused to a "tag polypeptide". The tag polypeptide has enough residues to provide an epitope against which an antibody 
can be made, yet is short enough such that it does not interfere with activity of the polypeptide to wNch it is fused. The 
tag polypeptide preferably also is f affty unique so that the antibody does not substantiany cross-react with other epitopes. 
Suitable tag polypeptides generally have at least six anano acid residues and usually between about 8 and 50 amino acid 
residues (preferably, between about 10 and 20 amino acid residues). 

IS As used herein, the term "immunoadhesin" designates antibody-like molecules which combine the binding 

specificity of a heterologous protein (an "adhesin") with the effector functions of immunoglobulin constant domains. 
Stnicturally, the immunoadhesins comprise a fusion of an amino acid sequence with the desired binding specificity which 
is other than the antigen recognition and binding site of an antibody (i.e., is "heterologous"), and an immunoglobulin 
constant domain sequence. The adhesin part of an immunoadhesin molecule typically is a contiguous amino acid sequence 

20 comprising at least the binding site of a receptor or a ligand. The immunoglobulin constant domain sequence in the 
immunoadhesin may be obtained from any immunoglobulin, such as ig6-1, lgG*2, lgG-3, or Ig64 subtypes, IgA (including 
lgA-1andlgA-2),lgE,lgDorlgM. 

"Active" or "activity" for the purposes herein refers to form(s) of CHEPO which retain a biological and/or an 
immunological actWity of nathre or naturally-occurring CHEPO, wherein biological activity refers to a biological function 

25 (either inhibitory or stimulatory) caused by a nathre or naturally-occurring CHEPO other than the ab'dity to induce the 
production of an antibody against an antigenic epitope possessed by a nathre or naturally-occurring CHEPO and an 
immunob^cal activity refers to the abiTrty to induce the production of an antibody against an antigenic epitope possessed 
by a native or naturafly-occurring CHEPO. Preferred Uological activities uictudes, for example, tiie abiity to regulate red 
blood eel production, to bind to receptors on tiie surface of committed progenitor ceOs of the bone marrow and/or other 

3 0 hematopoietic tissues andlor to mduce proliferation and/or tennmal maturation of erytiiroid cells. 

The tenn "antagonist' is used in the broadest sense, and inchides any molecule that partially or fuBy blocks, 
inhirtSr or neutrals a biologicd activity of a native CHEPO polypeptide iBsdosed herrin. In a simOar niaiu»r, die term 
"agonist" is used in the broadest sense and inchides any molecule tiiat ndndcs a Uological activity of a native CHEPO 
polypaptide disdoud herem. Suit^le agonist or antagoinst molecules sptedficaDy include agonist or antagonist antibodies 

35 or antibody fragments, fragments or amino add sequence variants of nathre CHEPO polypeptides, peptides, small organic 
molecules, etc Methods for identififing agonists or antagmists of a CHEPO polypeptide may comprise contacting a 
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CHEFO polypeptidG with a candidate agonist or antagonist molscuie and measuring a detectable change in one or more 
biological activities normally associated with the CHEPO polypeptide. 

"Treatment" refers to both therapeutic treatment and prophylactic or preventative measures, wherein the object 
is to prevent or slow down (lessen) the targeted pathologic condition or disorder. Those in need of treatment include those 
5 already with the disorder as well as those prone to have the disorder or those in whom the disorder is to be prevented. 

"Chronic" administration refers to administration of the agent(s) in a continuous mode as opposed to an acute 
mode, so as to maintain the initial therapeutic effect (activity) for an extended period of time. Intermittent administration 
is treatment that is not consecutively done without interruption, but rather is cyclic in nature. 

"Mammal" for purposes of treatment refers to any animal classified as a mammal, including humans, domestic 
10 and farm animals, and zoo, sports, or pet animals, such as dogs, cats, cattle, horses, sheep, pigs, goats, rabbits, etc. 
Preferably, the mammal is human. 

Administration "in combination with" one or more further therapeutic agents includes simultaneous (concurrent) 
and consecuthre administration in any ordm, 

"Carriers" as used herein include pharmaceutically acceptable carriers, excipients, or stabilizers which are 
15 nontoxic to the ceD or mammal being exposed thereto at the dosages and concentrations employed. Often the 
physiologically acceptable carrier is an aqueous pH buffered solution. Examples of physiologically acceptable carriers 
include buffers such as phosphate, citrate, and other organic acids; antioxidants including ascorbic add; low molecular 
weight (less than about 10 residues) polypeptide; proteins, such as s^ albumin, gelatin, or unmunoglobulins; hydrophiGc 
polymers such as polyvinyipyrroGdone; ammo acids such as glycine, glutanune, asparagine. arginina or lysme; 

2 0 monosaccharides, disaccharides, and other carbohydrates including gliuose, mannose, or dextrins; chelatiiq agents such 

as EDTA; sugar alcohols such as mannitol or sorbitol; salt*forming counterions such as sodium; and/or nonioiric surfactants 

such as TWEEN , polyethylene glycol (PEG), and PLURONICS . 

"Antibody fragments" comprise a portion of an intact antibody, preferably the antigen binding or variable region 

of the intact antibody. Examples of antibody fragments inchide Fab, Fab*, F(ab')2, and Fv fragments; diabodies; linear 
25 antibodies (Zapata et aL, Protein Eno. 8(10): 1057*1062 [1995]); singie*chain antibody molecules; and multispecific 

antibodies formed from antibody fragments. 

Papain digestion of antibodies produces two identical antigen-binding fragments, called "Fab" fragments, each 

with a single antigen-binding site, and a residua) "Fc" fragment, a designation reflecting the abSity to crystallize readily. 

Pepsin treatment yields an Rab')} fragment that has two antigen-combimng sites and is still capable of cross-Gnking 
30 antigen. 

"Fv" is the minhuon antibody fragment which contains a complete antigen-recognhion and -bincBng site. This 
region consist of a dimer of one heavy- and one Ight-chain variable domain in tight non-covalent assodatioa It is m this 
configuration that the three CDRs of each variable domain interact to deTme aii ant^en-binding site on the surface of the 
Vh-Vl dimer. Collectively, the six CDRs confer antigen-binding specificity to the antSiody. However, even a singfa varUile 

3 5 domain (or half of an Fv comprising only three CORs specific for an antigen) has the ability to recognize and bind antigeii, 

although at a lower affinity than the entire Undiog site. 

16 . . 
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The Fab fragment also contains the constant domain of the light chain and the first constant domain (CHI) of 
the heavy chain. Fab fragments differ from Fab fragments by the addition of a few residues at the carboxy terminus of 
the heavy chain CHI domain including one or more cysteines from the antibody hinge region. Fab'-SH is the designation 
herein for Fab' in which the cysteine residue(s) of the constant domains bear a free thiol group. Fiab'lj antibody fragments 
5 originally were produced as pairs of Fab* fragments which have hinge cysteines between them. Other chemical couplings 
of antibody fragments are abo known. 

The "fight chains" of antibodies Ommunoglobulins) from any vertebrate species can be assigned to one of two 
clearly distinct types, called kappa and lambda, based on the amino acid sequences of their constant domains. 

Dependmg on the amino acid sequence of the constant domain of their heavy chains, immunoglobuSns can be 
10 assigned to different classes. There are five major classes of immunoglobulins: IgA, IgD, IgE, IgG, and IgM, and several 
of these may be further dnided into subclasses (isotypes), e.g., IgGI, lgG2, lgG3, lgG4, IgA, and lgA2. 

"Single-chain Fv" or "sFv" antibody fragments comprise the VH and VL domains of antibody, wherein these 
domains are present in a single polypeptide chain. Preferably, the Fv polypeptide further comprises a polypeptide linker 
between the VH and VL domains which enables the sFv to form the desired structure for antigen binding. For a review of 
15 sFv, see Pluckthun in The Pharmacology of Monoclonal Antibodies, vol. 1 13, Rosenburg and Moore eds., Springer Verlag, 
New York, pp. 269-315 (1994). 

The term "diabodies" refers to small antibody fragments with two antigen-binding sites, which fragments 
comprise a heavy-chain variable domain (VH) connected to a light-chain variable domain (VL) in the same polypeptide chain 
(VH • VL). By using a linker that is too short to allow pairing between the two domains on the same chain, the domains 
20 are forced to pair with the complementary domains of another chain and create two antigen-binding sites. Diabodies are 
described more fully in. for example, EP 404.097; WO 93/1 1 1 61; and Hollinger et aL, Proc NatL Acad. ScL USA, 90:6444^ 
6448(1993). 

An "isolated" antibody is one which has been identified and separated and/or recovered from a component of its 
natural environment. Contaminant components of its natural environment are materials which would interfere with 

25 diagnostic or therapeutic uses for the antibody, and may include enzymes, hormones, and other proteinaceous or 
nonproteinaceous solutes. In preferred embodiments, the antibody will be purified (1) to greater than 95% by weight of 
antibody as determined by the Lowry method, and most preferably more than 99% by weight, (2) to a degree sufficient 
to obtain at bast 15 residues of N-terminai or internal amino add sequence by use of a spinning cup sequenator, or (3) to 
homogeneity by SOS-PAGE under reducing or nonredudng conditions using Coomassie blue or, preferably, silver stain. 

3 0 Isolated antftody indudes the antibody in situ within recombuiant ceDs since at least one component of the antibody's 
natural environment will not be present Ordinarily, however, isolated antibody virill be prepared by at bast one purification 

The word "label" when used herein refers to a detectabb compounti orcomposition which is conjugated Srectiy 
or indirectly to the antibody so as to generate a "bbebd" amibqdy. The bbel may be detectabb by Hself (e^. radiobotope 
35 bbeb or fluorescent bbeb) or, in the case of an enzymatic bbel may catalyze chemical alteration of a substrate compoumi 
or composition wlibh b detmrtabb. 
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By "solid phase" is meant a non-aqueous matrix to which the antibody of the present invention can adhere. 
Examples of solid phases encompassed herein inckide those fomied partially or entirely of glass (e.g^ controlled pore glass), 
polysaccharides (e.9., agarose), polyacrylamides, polystyrene, polyvinyl alcohol and silicones. In certain embodiments, 
depending on the context, the solid phase can comprise the weD of an assay plate; in others it is a purification column (e.g., 
5 an affinity chromatography column). This term also includes a discontinuous solid phase of discrete particles, such as 
those described in U.S. Patent Ho. 4,275,149. 

A "liposome" is a small vesicle composed of various types of lipids, phospholipids and/or surfactant whfeh is 
useful for delivery of a drug (such as a CHEPO polypeptide or antibody thereto) to a mammal The components of the 
Iqiosome are commonly arranged in a bilayer formation, simflar to the lipid arrangement of biological membranes. 
10 A small molecule is defined herebi to have a molecular weight below about 500 Daltons. 
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Tahhl 

* C'C increased from 12 to 15 
5 * Z is average of EQ 

6 is average of ND 

* match with stop is _M; stop stop - 0; J Qoker) match - 0 
•/ 

#define _M -8 /* value of a match with a stop */ 

10 

Int Jayt261[26| - { 

r ABCDEFGHIJKLMNOPQRSTUVWXYZ*/ 





/•A'/ 


{ 2. 0.-2. 0. 0.4, 0.-1,-2.1. 0,_M. 1. 0.-2. 1. 1. 0. 0.-6. 0.-3. 0}. 




/•B*/ 


{ 0, 3.4, 3. 2.-5. 0. 1.-2. 0. 0.-3.-2. 2,_iyi.-1. 1, 0. 0. 0. 0.-2,-5. 0.-3. 1}. 


15 


rc*/ 


{•2,4.1 5.-5,-5,4.-3,-3.-2. 0,-5.-6.-5.4._M.-3.-5.4. Q. Z 0,-2.-8. 0. 0.-5}. 






{0, 3.-5, 4. 3,-6. 1, 1.-2. 0. 0.4.-3. 2,_M.-1. 2.-1. 0. 0. 0,-2.-7, 0.4, 2}. 




/•£•/ 


{ 0, 2.-5, 3. 4,-5. 0. 1,-2. 0, 0.-3.-2. 1,_M,-1. Z-1, 0. 0. 0.-2.-7, 0.4. 3}. 




I'fV 


{4.-5.4,-8,-S, 9,-5,-2, 1. 0.-5. 2. 0.4,_M..5.-5.4,-3,.3, 0.-1, 0, 0, 7,.S}, 




I'BV 


{ 1, 0.-3. 1. 0.5. 5.-2,-3. 0,.2.4.-3. 0._y..l..1,.3, 1. 0. 0.-1,7, 0,5, 0}, 


20 


I'HV 


{•1, 1,-3, 1. 1,-2,-2, 6,-2. 0, 0,-2.-2.2._M.0.3,2,-l,-l.fl,-2,-3.0.0,2). 




r\'i 


{•1.-2.-2,-2,-2, 1,-3,-2, 5, 0,-2, 2. 2,-2,_M,-2,-2,-2,-1. 0, 0, 4.-5, 0.-1.-2). 




m 


{ 0, 0, 0, 0, 0. 0, 0, 0, 0, 0. 0. 0. 0. 0,_M, 0, 0, 0. 0, 0, 0, 0, 0, 0, 0. 0}, 




rK'i 


{•1, 0,-5, 0, 0,-5,-2, 0,-2. 0. 5,-3, 0, 1,_M,-1, 1, 3, 0, 0, 0,-2,-3, 0,4, 0}, 




riv 


{-2,-3,.6,4,-3, 2,4,-2. 2. 0.-3, 6, 4,.3,_M,-3,-2,-3,-3,-1, 0, 2.-2. 0,-1,-2}, 


25 




{•1.-2.-5.-3,-2. 0.-3.-2. 2. 0. 0, 4. 6,-2._M.-2,-1. 0.-2.-1. 0. 2,4, 0.-2.-1}. 






{ 0. 2.4. 2, 1.4. 0, 2.-2. 0. 1,-3,-2. 2._M.-1, 1, 0, 1. 0, 0.-2,4. 0.-2. 1}. 




ro'i 


{_M._M,_M._M,_M,_M,_M,_M,_M,_M,_M,_M,_M,_N10,_M,_M,_M,>,_M._II*_^^ 






{ 1.-1.-3,-1 ,-1,-5,-1, 0,2, 0,-1.-3.-2.-l,_M, 6, 0, 0, 1, 0, 0.-1,-6. 0,-5. 0}. 




rav 


{ 0, 1.-5. 2. 2,-5,-1, 3,-Z 0. 1,-2.-1, 1,_M, 0, 4. 1.-I.-1, 0,.2..5, 0.4, 3}, 


30 




{2, 0.4.-1.-1,4,-3, 2.-Z 0. 3.-3. 0. 0._M. 0, 1, 6, 0.-1, 0,-2, 2. 0,4. 0}. 




rsv 


{ 1. 0. 0. 0. 0.-3, 1,1,-1, 0. 0.-3.-2, 1._M, 1.1, 0, 2. 1, 0,-1,.2, 0,3. 0}. 




/•!•/ 


{ 1. 0.Z 0. 0.-3, 0,-l. 0, 0, 0,-1,-1, 0._M. 0,-1.-1, 1, 3, 0, 0,-5. 0,-3, 0}. 




ru'i 


{ 0, 0, 0, a 0, 0. 0, 0, 0, 0, 0. 0, 0, 0._M. 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}. 




rvv 


{ 0,-2,-2, Z-2.1.1. Z 4, 0, 2. 2, 2,-2,_M, l,-2,-2.-1. ft ft 4.-6, 0,-2,-2}, 


35 


rwf 


{•6,-5,-8,7,-7, ft.7,.3.-5, 0.-3,-2,4,4,^M,.8,-S, llS. 0,-ft17, 0, ft-O), 




rx*/ 


{ftftftftftftftftaaftftftft_M,ftftftftftftftO,Q,0.0}. 
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/• Y •/ {■3,-3, 0,4,-4. 7,-5, 0,1, 0,4,-1,-i-2,_M,-5,4,4,-3,-3, 0,-2, 0, 0,10,4}, 
;• Z */ { 0, l,-5, 2, 3.-5, 0, 2.-2. 0, 0,-2.1, 1,_M. 0. 3, 0. 0. 0. 0.-2,-6, 0,4, 4} 

}; 
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Table 1 (cont) 

r 

^include <stdio.h> 
5 #include <ctype.h> 



10 



15 



Mgftne MAXJMP 


16 


r max jumps in a diag */ 


Mefine MAXGAP24 


/* don't cgntiiHie to penalize gaps larger than this */ 


Mafine JMPS 


1024 


/"maxjmpsinanpath*/ 


^'define MX 


4 


1* save if there's at least MX-1 bases since 


#define OMAT 


3 


/* value of matching bases */ 


Mefine OMIS 


0 


/* penalty for mismatched bases *f 


Mefine DINSO 


8 


/'penalty for a gap*/ 


AMina 0INS1 


1 


/* penalty per base*/ 


Mefine PINSO 


8 


/* penalty for a gap*/ 


#define PINS1 


4 


/* penalty per residne */ 



iastjmp */ 



struct^ { 
20 short 



nIMAXJMPl- 



unsigned short x[MAXJMPl; 



rsizeof jmpCneg fordely) */ 
r base no. of jmp in seqx*/ 
/' limits seq to 2" 16-1*/ 



25 



30 



struct diag { 
int 
long 
short 
struct jmp 

); 



struct path { 
int 



35 }; 



score; /* score at last jmp */ 

offset; /* offset of prev block */ 

ijmp; Tcorrent jmp index*/ 

jp; /*listof jmps*/ 



spc; 



/* number of leading spaces */ 
short nlJMPSl-r size of jmp (gap) */ 
int xfJMPS]; /* loc of jmp (last elem before gap) */ 
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ch8r 


ofne. 


/* output file name *l 


Char 


namexiZj; 


/* seq names: getseqsO */ 


clur 


•prog; 


r prog name for err msgs */ 


Char 


seqxuj. 


/* seqs: getseqsO *l 


in* 

ini 


dmax^ 


/ pesi oiag: nw(i / 


ini 




/ final oiag / 


int 


ana; 


f set it dna: mainll / 


int 


endo^ps; 


/* set if penardng end gaps *l 


int 


gapx.gapy; 


/* total gaps m seqs*/ 


int 


lenujeni; 


/ seq lens / 


Int 


ngapx, ngapy; 


/* total size of gaps */ 


int 


smax; 


/* max score: nwfl */ 


int 


*xbm; 


/* bitmap for matching */ 


long 


offset; 


/* current offset in jmp file */ 


struct diag 


•dx; 


/* holds diagonals */ 


struct path 


PP121; 


f* holds path for seqs */ 


char 


•callocl), *malloc{L 'indexO, •strcpyO; 


char 


*getseqO, *g_cailQcO; 
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Table Kcont) 

/* Needleman Wunsch alignment program 

* usage: progs filel fileZ 

* where filel and fileZ are two dna or two protein sequences. 

* The sequences can be in upper- or lower-case an may contain ambiguity 

* Any lines beginning with V. * > ' or * < ' are ignored 

* Max file length is 65535 (limited by unsigned short x m the jmp struct) 

* A sequence with 1/3 or more of its elements AC6TU is assumed to be DNA 

* Output is in the file "afign-out" 
« 

* The program may aeate a tmp file in /tmp to hold info about traceback. 

* Original version developed under BSD 4.3 on a vax 8650 
•/ 

j^nclude "nw-h" 
#inciude "day-h" 



static Jbval[26] - { 

1,14,2,13AOAnA0,1U%15AOA5M8J,9A1O,0 

static jbvaIl261-{ 

1, 2|n < <rD^Tl)l(1 < <fNVA*)L 4, 8. 16, 32, 64, 

128, 256, OxFFFFFFF, 1< < 10, 1< < 1 1, 1< < 12, 1< < 13, 1< < 14, 

1< < 15, 1< < 16, 1< < 17, 1< < 18, 1< < 19, 1< <20, 1< <21J < <22, 

1 < <23, 1 < <24, 1 < <25|(1 < <fEVA'))|(1 < <rQVA')) 

}; 

mainlacav) nuin 
bit ac; 
char ""avD; 

{ 

prog-avfOl; 
ff(acl-3){ 

fprintf(stderr,"usage: %s filel file2ln", prog); 

fprintfistderr/whare filel and file2 are two dna or two protnn sequences.^'); 

23 
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fprintflstderr/The sequences can be in upper- or lower-case\n*'); 
f printf(stderr/Any lines beginning with or * < ' are ignoredln**); 
f printfistderr/Output is in the file ralign.autW); 
exitd); 

} 

namexIO] - av(1L' 

namexfl] - avfZl- 

seqxIO] - getseqinamexlOl &lenO); 

S8qx[1] - getseq(namex(1L &len1); 

xbm - (dna)? .dbval : jbval; 

endgaps - 0; /* 1 to penalize endgaps */ 

ofile - "align.out"; /* output file •/ 

nwO; /* fill in the matrix, get the possible jmps */ 

readjmpsO; 1* get the actual jmps */ 
printO; /* print stats, alignment */ 

cleamip{OI; T unlink any tmp files */ 



24 
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Table 1 (cont ) 

/* do the alignment return best score: mainO 

* dna: values in Fitch and Smith, PNAS, 80. 1382-1386, 1983 

* pro: PAM 250 values 

* When scores are equal, we prefer mismatches to any gap, prefer 

* a new gap to extending an ongoing gap, and prefer a gap in seqx 

* toagapinseq y. 



10 { 



char 


*P3t. 


/* seqs and ptrs */ 


int 


*ndely, •dely; 


riceep track of dely*/ 


int 


ndelx, delx; 


f* keep track of delx */ 


int 


•tmp; 


r for swapping rowO, rowl */ 


int 


mis; 


/* score for each type */ 


int 


insO, insi; 


/* insertion penalties*/ 


register 


id; 


/* diagonal index */ 


register 


ii; 


/* jmp index */ 


register 


•colO, •coll; 


/* score for curr, last row */ 


register 


XX. yy; 


/* index mto seqs*/ 



20 



dx •> (struct diag *)g_calloc("to get diagsMehO-i-ienI 1, sizeofistnict diag)); 

ndely « Qnt *}g_cailoc("to get ndely"« leni 1, sizeoflint)); 
25 dely - (int •)g_caI!oc("to get dely", leni + 1, sizeoflint)); 

colO - ant •)g_caIIoc("to get colO", leni + 1, sizeoflint)); 
coll - flnt *)g_caliocrto get coll", tenUI, sizeoflint)); 
insO-(dna)?OINSO:PINSO; 
insi- (dna)? DINS1:PINS1; 

30 

smax « -10000; 
if(endgap$){ 

for(colO[01-delyI01--iiKO,yy-1;yy <- len1;yy++){ 
colOIyy] - delylyy] - colOlyy-l] - insi; 
35 ndelylyyl - yy; 

) 

25 
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CQlOIO] « 0; r Waterman Bufl Math Biol 84 '/ 

} 

else 

for(yy - l;yy <- |enl;yy++) 
5 delylyyl • -insO; 

f* fill in match matrix 
•/ 

for (px - seqxfOl xx - 1; xx < - lenO; px+ +, xx+ +| { 
10 /* initialize first entry in col 

•/ 

if (endgaps) { 

if(xx.-l) 

coIKO] - delx - -(insO+insI); 

15 ebe 

colKOI-delx-colOIOlmsl; 
ndelx - xx; 

} 

'•>*•{ 

20 colUd-O; 

delx - -insO; 
ndelx -.0; 

} 
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Table 1(cont) 

forlpy - seqxIUyy - 1;yy < - lenl;py++.yy++| { 
mis *• colO[yy-1]; 
5 If(dna) 

mis + - (xbml*px-'A'l&xbmCpy.Xll? DMAT : DMIS; 

else 

mis+- Jayl^px'ATpy-'AT 

10 /* update penalty for del in x seq; 

* favor new del over ongong del 

* ignore MAXGAP if weighting endgaps 
•/ 

if (endgaps 1 1 ndetylyy] < MAXGAP) { 
15 if (colOIyy] - insO > - delylyy]} { 

detylyy] - colOIyy] • (insO-»-ins1); 
ndeiylyyl - 1; 

}else{ 

delyfyyl — inal; 

20 ndcly(yyl++; 

} 

}el$e{ 

if (colOlyyl • (insO+insI) > - delyiyyD { 
delylyyl - colO[yyl•(insQ+insl); 
25 ndelyIyyl-1; 

}eUe 

ndelylyyj++; 

} 

30 /* update penalty for dd in y seq; 

* f avof new del over ongong del 
•/ 

if (endgaps 1 1 ndelx < MAXGAP) { 
H(col1(yyH-insO>-deix){ 
35 debt - CQl1(yy-1] • (insO<»ins1); 

-1; 
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}else{ 

delx-- insl; 
ndelx++; 

} 

}els8{ 

If {coIKvy U - (insO-f insll > - delxl { 
deh - cal1(yy-1) -(insO+insD; 
ndah - 1; 

}eln 

ndeh ■•-<••; 

} 

/* pick the maxinum score; we're favoring 
* mis over any del and delx over dely 
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Table llcont) 

id - xx-yy + lenl ■ 1; 
if (mis > - delx &&niis > - delylyy]) 
5 colKyyl - mis; 

else if (delx >- delylyy]) { 

col1(yy] - delx; 

ij - dxDdUjmp; 

H(dxOdLip.n{0]&&(!dna 1 1 (ndeix >- MAXJMP 
10 &&XX > dxed].jp.x[iO-*-MX) 1 1 mis > dx[idl.score+OINS0» { 

dxDd].tjmp<<--»-; 
if (^^ij>- MAXJMP) { 
writejmps(id); 
ij - dxridlJjmp - 0; 

15 dxDd].offset - offset; 

offset sizeoflstructjmp) + sizeof (offset); 

} 

} 

dxrHiMp.nIq] - ndeh; 
20 ' dxDdl.jpJi(q] - xx; 

dxGdLscore - delx; 

> 

eise{ 

colKyy] - d8ly[yyj; 
25 ij-dx(id|.pp; 

if (dxDdl.jp.n[01 && (Idna 1 1 (ndelylyy] > - MAXJMP 

&& XX > dxpdLip.x[ij]+MX) 1 1 mis > dxrid].score<i-DINSO)) { 
dxtid).pp-»"i-; 
lf|*+ij>-MAXJMP){ 
30 writ^mps(id); 

ij - dx(id].ijmp - 0; 
dxOdLoffset - offset; 

offset slzeof(stnict Imp) Sfzeof(offset); 

} 

35 } 

dxDdLjp.n(q| - -edelytyyl* 
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dx(id),ip.x[ijl - xx; 
dx[id].score - deiylyy]; 

} 

if|xx--lenO&&Yy < lBn1){ 
5 riastcol 

•/ 

ifflendgaps) 

colKyyl- insO-»-tnsr(len1-yy); 
iflcolKyy] > smax){ 
10 snuut - coiUyyl; 

dmax - id; 

} 

} 

} 

15 if (endgaps && n < lenO) 

colllyy U - insO+insrOenO-xxl; 
if(col1(yy-1]>»nax){ 
smax - colUyy-11: 
dmax •■ id; 

20 } 

tmp - colO; colO • coil; coll - tmp; 

} 

(void)fr88|(ehar*)nd8ly); 
(void) freeUeliar *)dely); 
25 (voidlheellciiar'M 

(«oid)fre8((cliar«)col1h } 
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Table Kcont) 

♦ 

* printO - only routine visible outside this module 
5 • 

* static: 

* getmatO - trace back best path, count matches: printO 
pr_align() - print alignment of described in array pQ: printO 

* dumpblockO - dump a block of lines with numbers, stars: pr_align() 
10 * numsd - put out a number line: dumpblockO 

* putlineO - put out a line (name, (numl seq, {miml): dumpblockO 

* stars!) - -put a line of stars: dumpblockO 

* stripnameO strip any path and prefix from a seqname 
•I 

15 

#tnclude"nw.h" 
#defineSPC 3 

#def ine P_UNE 256 /* maximum output line *i 
20 #defineP_SPC 3 /'space between name ornum and seq*/ 

extern Jayl261t26L- 

int olen; /* set output line length *f 

FILE, 'fx; r output file*; 

printO print 
{ 

int lx,ly,rirstgap,tastgap; T overlap*/ 



25 



30 if ((fx - fopeniofile, "w")) — 0) f 

f printf(stderr,"%s: can't write %s\n", prog, ofOe); 
deanupll); 

} 

fprintflfx, * < first sequence: %s (length - %d)\n", namexfOL loiO); 
35 fpnntf(fx,"< second sequence: %s (length -%dHn",namexIUIen1); 

olen -60; 
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10 



15 



20 



ix - !enO; 
ly - ieni; 

firstgap - iastgap - 0; 

if (dmax < lenl • 1) { /• leading gap in x */ 
priOUpc - firstgap - lenl - dmax • 1; 
ly-ppI01.spc; 

} 

else if ((bnax > lenl - 1) { T leading gap in y */ 
pp[11.spc - firstgap - dmax • (lenl • 1); 
Ix- ppllUpc; 

} 

H (dmaxO < lenO • 1 ) { r traning gap in x */ 
Iastgap - Ien0>dmax0-1; 
]x — Iastgap; 

} 

else if (dmaxO > ienO ]){ /' trailing gap in y */ 
Iastgap - dmaxO-(ienG-1); 
ly — Iastgap; 

} 

getmatdx, ly, firstgap, Iastgap); 
pr^alignl); 



25 
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Tablet (cont) 

r 

* trace back the best path, count matches 
•/ 

5 static 

getmatdx, ly, firstgap, lastgap) getmat 
int Ix, ty; "core" (minus endgaps) */ 

int firstgap, lastgap; I* leading trailing overiap */ 

{ 

10 int nm,iOJ1.sizO,siz1; 

char outx[32]; 
double pet; 
register nO,n1; 
register char *pOi*p1; 

15 

r get total matches, score 
*/ 

10 - il - sizO - sizl - 0; 
pO - seqxIO] + pp{l].spc; 
20 pi - seqxd] -I- pp[OLspc; 

nO - pplUspc + 1; 
n1 - pp[0].spc + 1; 

nm - 0; 

25 while {*pO&&*p1}{ 

iflsizOK 

pl++; 
nl++; 
sbO-: 

30 } 

«ls8U(siz1|{ 
pO**; 
nO**; 
sol-; 

35 } 

•It8{ 
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iff(xbin(*pO>'A1&xbm(*p1-'A'D 

nin++; 
iflnOi--^ --pp[0].x(iO]) 

sizG - pp(OLn[iO++]; 
lf(nl + + --ppllWill) 

sizi - ppl11.n[il + +L- 

pO++; 
p1 ++; 

) 

} 

/* pet homoiogY: 

* if penaHzing endgaps. bass is the shorter seq 

* else, knock off overhangs and take shorter core 
•/ 

if(endgaps) 

ix-(lenO < Ien1)?len0:len1; 

else 

lx-Ox<ly)?lx:ly; 
pet - 100.*(doable)nml(doiihb)lx; 

fprintf|fx,"\n"); 

fprintf(fx« " < %d mdtch%s in an overlap of %d: %3i percent similarity\n", 
nm,(nm-- 1)7 : "esMx, pet); 
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Table Kcont) 

f printf If X, " < gaps in first sequence: %d", gapx); .^etmat 
ifigapx){ 

(void) sprintf(outx, " (%d %s%s)", 

ngapx, |dna)? "base":"re$idue". (ngapx - - II? "Vs"); 

fprintf(fx,"%s', oulx); 

fprintflfx, gaps in second sequence: %d", gapy); 
ingapy){ 

(void) sprintfioutx. " (%d %s%s)'. 

ngapy, (dnal? "base":"residue", (ngapy - - 1)? "":"s"|; 

fprintf(fx,-%s", outx); 

} 

if(dna) 

fprintflfx, 

"\n< score: %d (match - %d, nusmatch - %d, gap penalty - %d %d per baselln". 
smax, DMAT, DMiS, OINSO, 0INS1); 

ebe 

fprintf(fx, 

*^n < score: %d (Dayhoff PAM 250 matrix, gap penalty - %d -i- %d per residuenn", 
smax«PINS0,PINS1); 
if(endgaps) 

fprintf(fx, 

" < endgaps penalized, left endgap: %d %s%s, right endgap: %d %s%s\n". 
firstgap, (dna)? "base" : "re8idue^ (f irstgap - - 1 1? " : V, 
lastgap. (dna)? "base" : "residua", (lastgap - - 1)? "° : "s"); 

else 

fprintf(fx, " < endgaps not penalizedln"); 



static 


ran; 


r matches in core *- for checking */ 


static 


bnax; 


f * lengtiis of stripped file names *f 


static 


321; 


rjmp index for 8 path''/ 


static 


nc|2L* 


r number at start of cunfent line 


static 


m 


r current elem number - for gaping */ 
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Static siz|2]; 

static char 'psRl; /• ptr to current element */ 

static char *po(2]; /* ptr to next output char slot */ 

static char QUt[21[P JINE]; /* output line */ 

5 static char staiIP_UNEt /* set by starsO */ 

* print alignment of described in struct path ppD 
10 static 

PL*n(l pr_align 
{ 

int nn; /* char count *f 

int more; 
15 register i; 

for(i- 0,lmax-0;i < 2;i++){ 
nn - stripname(namex[i]); 
if (nn > Imax) 
20 Imax - nn; 

nc[0.1; 
ni[il-1; 
sizDl - iiDl - 0; 
25 pS[i] > seqxOl; 

poOl-outOL* } 
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for (nn » nm - 0, more <- 1; more; ) { ...pr align 

for fi - more - 0; i < 2; i+ +) { 

r 

* do we have more of this sequence? 
•/ 

ifCrpsOl) 

contlnue; 

morc++; 

if (ppIO-spcl { /* leading space *f 
•pora++-"; 
PPW^pc-; 

} 

elseif($iz[il){ /'in a gap*/ 
sizlil-; 

} 

else{ /* we're putting a seq element 

•/ 

•poIO - •psD); 
ifrislower(*psn])) 

•psta - toupperCpslill; 

po[i]++; 
psia++; 

r 

* are we at next gap for this seq? 
•/ 

ifiiHOI — ppIOJi(iiIilD{ 

r 

* we need to merge all gaps 

* at this location 

*' 
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slzIi)-ppIi].n[p++L 
while |ni(n--ppIO.x[ii[i]l) 

si2lil + -pplil.n[iini++]; 

} 

5 nji)++; 

} 

} 

if I-*- i-mi - - olen 1 1 Imore && nn) { 
dumpblocMI; 

10 fiir(i-0;i < 2;i++) 

po{i] - outlifc 

nn - 0; 

} 



15 } 



/• 

* dump a block of liius, including numbers, stars: pr_aiignO 
•/ 

20 static 

dunqiblocM) rfmffpjpjTC^ 
{ 

register 1; 

25 forO-0:i<2;l*+) 
•poDI- - Hr; 
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Tahle1(cont| 



^dumpblock 



(void) putc('\n\ fx); 
5 for|i-0;i < 2;i++){ 

ifroutlfl&&rout[i]!-"|| •{polil)|.")){ 
il(i"0) 

Rums(i); 
if(i--0&&*out[1]) 
10 starsO; 

putlineti); 

if(i--0&& ''outm) 

f printf (f X, star); 
Iffi.-l) 

15 nuins(i); 
} 

} 

} 

20 /• 

* put out a number line: dumpblockO 

static 

nums(ix) 

2 5 int ix; r index In outQ holding seq line *i 



{ 



char nlinelP^UNEt 
ragister Ij; 
register char 'pn, *px, *py; 



30 



for (pn - nline, i - 0; l< bnax^i^P^SPC; i^ +, pn-i* 
for 0 - ncful py - outfixl; 'py; py+ +, pn+ +) { 

ifCpy — -||*py..V) 
35 >-•'; 

»{ 
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lf(i%10--0||li--1&&ncl«H-1|){ 
j-fi<0)?i:i; 
forlpx - pn;j;i/- 10, px -) 
•px - i%10 ♦ 'OV 

if(i< 0) 

•px - 

} 

else 

•pn-"; 

} 

} 

•pn - 10'; 
nciix] - i; 

for (pn - nline; •pn; pn-» + 1 
(voM) putcCpn, fxl; 
(voidlputclV.fx); 

} 

r 

* put out a Gne (name, [numl seq. (numO: dunvblockO 
•/ 

static 

putline(ix) putlim 
int ix; { 
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Table 1 (cont ) 



int i; 
5 regbterchar *px; 

for (px - namexOxl i - G; *px && *px !- V; px-^-i', i-f 

(voidlputcrpx^fx); 
forti < lmax+P_SPC;i++) 
10 WoidlputcrMx); 

r these count from 1: 
* nQ is current element (from 1) 
ncQ is number at start of current Kne 

15. •/ 

forjpx - outfix); *px; px++) 

(void) putc(*px&0x7F, fx); 
(void)putc('\n\fx); 



20 



30 



K(rout[01 1 1 routfO] " -*&&*(po[Q» — ' 1 1 1 
routlll 1 1 rmitlll " • • && '{poIlD - - ' '» 



px-star; 

35 for(i-lmax<i-PJPC;i;i-) 
•px++ «••; 

41 
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r . 

* put a line of stars (seqs always in outiOl outfU): dumpblockO 
•/ 

25 Static 

starsO stars 
{ 

int i; 

register char *pO, *p1, cx, *px; 
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for (pO - outlOl pi - outdl; •pO && 'pi; pO+ +, pi + +) { 
if(isatpha(*pO)&&isalpha{*p1»{ 

if(xbmI*pD.'A'l&xbmfpl-'A'l){ 
cx - 
nm++; 

} 

eheif ildna&&Jay(*pO-'Arp1-'A'] > 0) 
cx - V; 

else 

) 

else 

a-"; 
*p3i++ - cx; 

} 

•px++ - V; 
•px - •«)•; 
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TahlelteontI 

* strip path or prefix from pn, return len: pr_aiign() 
5 static 

stripnameipn) stripname 
char *pii; /* file name (may be path) */ 

{ 

register char *px/py; 

10 

PY-0; 

forlpx -pn; "px^-px**) 
ifrpx--T) 

py - px + 1; 

15 if (py)* . 

(void) strcpy(pn, py); 
return(strten(pn)); 

} 

20 



25 



30 



35 
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T^Meljcofrt) 

/• 

* cleanup!) •* cleanup any tmp file 

* getseqO - read in seq, set dna, len, maxlen 
5 • g^callocO callocO with error checkin 

* readjmpsO - get the good jmps, from tmp file if necessary 

* writejmpsO - write a fWed array of jmps to a tmp file: nw() 
•/ 

^include "nwJi" 
10 include <sys/file.h> 



char •jname - "/tmp/homgXXXXXX"; /• tmp file for jmps */ 

FILE Ti; 

15 int . cleanupO; /• cleanup tmp fUe */ 

long IseekO; 

r 

* remove any tmp file if we blow 
20 •/ 

m 

int i; 

{ 

ififj) 

25 (void)unlink(inaffle); 
exitlil; 

) 

r 

30 'read, return ptr to seq. set dm, hn.in8]dw 

* sUp bnt starting with ';','<*. or '> ' 

* seq in upper or lower case 
•/ 

diar * 

35 getsei|(fae,leii) 

char 'fib; /* file name*/ 
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int 'len; r seq len V 

{ 

char linen 024L*pseq; 
register char *px, *py; 

Int natgc, tien; 

FILE Mp; 



K((fp-fopen(fils.VU — 0){ 

fprintf(stderr,"%s: can't read %s\n\ prog, file); 
exitd); 

} 

tIen - natgc - 0; 

while (fgets(lineJ024,fp»{ 

if CBne V 1 1 'line 1 1 'line '>') 

continae; 
for (px - line; 'px ! - In'; px+ + ) 

if (isupper(*px) 1 1 istower(*pxH 
tlen++; 

} 

if (Ipseq - inanoc((iii»igiieii)(tlen-^6») - - 0) { 

fprinti(stderr,'%s: mdlocO failed to get %d bytes for %sln''. prog, tien-i-6, ffle): 
exitdl; 

} 

pseq(01 - pseqd] - pseqtZ] - pseqlS] • 10'; 
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Table Hcnnti 



30 { 



char *|NC *calloc(l; 



.getseq 



py - pseq + 4; 
*len - tlen; 
5 rewind(fp}; 

while (fgets(lineJG24Jp)){ 

lfrKne-.V||'linB--'<'|p||ne.-'>'l 
continue; 

10 for (px - line; 'px !- V; px-f +1 { 

if(isupper(*px)) 

•py++-'px; 
elseif (islower(*px)) 

•py++ - toupperf*px); 
15 if(index|-ATGCU"/{py.1|)) 

natoc++; 

} 

> 

*py++ -10'; 
20 'py-W; 

(wU)fclos8(fp); 

dna - natgc > (tlei^); 

retnni(pseq'«-4); 

} 

25 

char * 

g_calloc(msg,nx,sz) g.calloc 
char *in$g; T program, calling routine */ 

int nx, sr, /* number and size of elements *1 



if ((px - calloc((unstgnedka/(uniigned)s2)) - - 0) { 
ifCmsgK 

3 5 fprintf(stderr, "%8: g_callocO failed %s (n-%d, S2-%d)ln-, prog, msg, nx. szfc 

46 



wo 00/68376 PCT/USOOA12370 
} 

} 

return(px); 

} 

5 

r 

* get final jmps from dxD or tmp file, set ppH reset dmax: mainO 
•/ 

readjmpsO readjmps 
10 { 

iat fd - -1; 

int sizJ0,i1; 
register i, xx; 

15 if(fi){ 

(void) fclose(f j); 

if ((fd - openQname. OJDONLY, 0)) < 0) { 

f printf(stderr, "%s: can't openi) %sln^ prog, jname); 
deanupdl; 

20 } 
} 

for (i - iO - i1 - 0, dmaxO - dmax, xx - lenO; ;(++){ 
while (1){ 

for (i - dx(dmaxl.ijmp; j > - 0 && dx{dmax].jp.x|j] > - xx; j-) 
25 ; 
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..readjinps 

if (j < 0 && dx[dmax).offset && fj) { 

(void) tseekd d, dx[dmax].Qffset 0): 
5 (void) read(f d, (char *)&dx[dmax].jp. sizeoKstruct jmp)); 

(void) read(fd, (char *)&dx(dmax].offset, sizeof(dx(dmax].offset)); 
dxIdmaxUjmp - MAXJMP-1; 

} 

else 

10 break; 
} 

H(i>-JMPS){ 

fprintflstderr, '%« too many gaps in alignmentin', prog); 
deamipdl; 

15 } 

if(i>-0){ 

slz - dx[dmaxLjpj)(jI; 
XX - dx[dmax].jp.xG]; 
dmax + - siz; 

20 if (siz < 0){ r gap in second seq*/ 

ppIllnOll - -siz; 
XX + - siz; 

/• id-xx-yy + lenl-l 
•/ 

25 ppll Will - XX • dmax + leni • 1; 

gapy++; 
ngapy— siz; 
r ignore MAXGAP when doing endgaps 

siz - ( siz < MAXGAP 1 1 endgaps)? -siz : MAXGAP; 

30 

} 

else if isiz > 0) { /* gap in first seq */ 
ppIOlnpG] - siz; 
pp[Olx(iO] - xx; 
35 gapx <!">-; 

ngapx-i--«z; 
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/* ignore MAXGAP when doing endgaps *l 

ta • ($iz < MAXGAP 1 1 endgaps)? siz : IUIAX6AP; 
iO++; 

} 

5 } 
else 

break; 

} 

10 f* revBTSB the order of pnps 

•/ 

for (j - 0, ffl-; j < iO; j+ +, iO-l { 

i ■ ppVLnQt pp(0|.n|j] - pp(0].fl|iO];pp[0]jinO] - i; 
t - PPtOLxIp: pplOlJiijI - ppIOLxpO]; pp(0Li(i01 - i; 

15 } 

forQ-0,i1~;j< i1;|++,i1-){ 

i - ppl1|.nOL-ppI11.nOI - pp[11.n(i1];pp{1]j)ni] - i: 
i - PP(1].x6L-pp(1].xlil - ppI1].itnil-ppI1lJi(i1] - i; 

} 

20 K(fd>-0) 

(«oid)close(fd); 

\m{ 

(void) unlinklinanie}; 
fj-0; 

25 offset -0; 

} } 
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* writfi a filled jmp struct offset of the prev one (if any): nwO 
5 •/ 

writejmpsdx) wrhejraps 
int ix; 

{ 

char ^mktempO; 

10 

if(!fi){ 

if(ml(temp(jname| < 0){ 

fprintflstderr, "%s: can't mktempO %s\n", prog, jname); 
cleanupd); 

15 } 

if |(f j - f openljname, "w")) - - 0) { 

fprintfistderr, "%s: can't write %s\n", prog, jname); 
exit(1); 

} 

20 } 

(void) fwriteKehar *)&dxrix].jp, sizeoKstnictjmp), 1, fj); 
(void) fwrite((diar *)&dxGxl.offset sizeof(dx[ix].offset), 1. f|); 

} 

25 



30 
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Table 2 

PRO XXXXXXXXXXXXXXX (Length - 15 amino acids) 

Comparison Protein XXXXXYYYYYYY (Length - 12 amino adds) 

% amino acid sequence Identity - 

(the number off identicaiiy matching amino acid residues between the two polypeptide sequences as determined by AUGN-2) 
dhrided by (the total number of amino acid residues of the PRO polypeptide) - 

5 divided by 15 -33.3% 
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Table 3 



PRO 



XXXXXXXXXX 



(Length - 10 amino acids) 
(Length - 15 annino acids) 



Comparison Protein 



XXXXXYYYYYYZZYZ 



5 



% amino acid sequence identity - 

(the number of identically matching amino acid residues between the two polypeptide sequences as determined by AUGN-2) 
divided by (the total number of amino acid residues of the PRO polypeptide) - 

10 

5dividedby10-50% 
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PRO-ONA NNNNNNNNNNNNNN (Length - 14 nucleotides) 

Comparison DNA NNNNNNLLLLLLLLLL (Length - 16 nucleotides) 

% nucleic acid sequence identity - 

(the number of identically matching nucleotides between the two nucleic acid sequences as detennined by ALIGN-2) divided 
by (the total number of nucleotides of the PRO-DNA nucleic acid sequence) - 

6 divided by 14 -4ZS% 
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TabteS 

PRO DNA NNNNNNNNNNNN (Length - 1 2 nucleotides) 

Comparison DMA NNNNillW (Length - 9 nucleotides) 

% nucleic acid sequence identity - 



(the number of identically matching nuclratides between the two nucleic add sequences as detemuned by ALIGN-2) divided 
by (the total number of nucleotides of the PRO-DNA nucleic acid sequence) - 

4dhfidedby12-33.3% 
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11. Compositions and Methods of the tnvention 

A. FulMenoth CHEPO PoivDeotide 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides refened 
to in the present appfication as CHEPO. In particular^ ONA encoding a CHEPO polypeptide has been identified and isolated. 
5 as disclosed in further detail in the Examples below. 



B. CHEPO Variants 

In addition to the f ulMength native sequence CHEPO polypeptides described herein, it is contemplated that 
CHEPO variants can be prepared. CHEPO variants can be prepared by introducing appropriate nucleotide changes into the 

1 0 CHEPO ONA, and/or by synthesis of the desired CHEPO polypeptide. Those skilled in the art wiD appreciate that amino 
acid changes may alter post transiationai processes of the CHEPO, such as changing the number or position of 
giycosylation sites or altering the membrane anchoring characteristics. 

Variations in the native full-length sequence CHEPO or in various domains of the CHEPO described herein, can 
be made, for example, using any of the techniques and guidelines for conservathre and non-conservative mutations set 

15 forth, for instance, in U.S. Patent No. 5,364,934. Variations may be a substitution, deletion or insertion of one or more 
codons encoding the CHEPO that results in a change in the amino acid sequence of the CHEPO as compared with the native 
sequence CHEPO. Optionally the variation is by substitution of at least one amino acid with any other amino acid in one 
or more of the domains of the CHEPO. Guidance in determining which amino acid residue may be mserted, substituted or 
deleted without adversely affecting the desired acthrity may be found by comparing the sequence of the CHEPO with that 

20 of homologous known protein molecules and mniimizing the number of anuno acid sequence changes made in regions of 
high homology. Amino acid substitutions can be the result of replacing one amino acid with another amino acid having 
similar structural and/or chemical properties, such as the replacement of a leucine with a serine, Le., conservative amino 
acid replacements. Insertions or deletions may optionally be in tiie range of about 1 to 5 amino acids. The variation 
allowed may be determined by systematically making insertions, deletions or substitutions of amino acids in tiie sequence 

2 5 and testing tiie resulting variants for activity exhibited by tiie full-iengtii or mature native sequence. 

CHEPO polypeptide fragments are provided herein. Such fragments may be truncated at the N-terminus or C- 
terminus, or may lack internal residues, for example, when compared with a full length native protein. Certain fragments 
lack andno acid resigns that are not essential for a desired biological activity of tiie CHEPO polypeptide. 

CHEPO fragments may be prepared by any of a number of conventional techniques. Desired peptide fragments 

30 may be chemically syntfiesizei An ahemative approach invobes generafuig CHEPO fragments by enzymatic digestion, 
B.g., by treating tiie proteui wttii an enzyme known to cleave proteins at sites defined by particular anuno add residues, 
or by cBgesting the ONA witii suitable restriction enzymes and isolating tiie desired fragment Yet anotiier suitable 
technique invohres isolating and amplifying a ONA fragment encoding a desired polypeptide fragnmt by polymerase chain 
reaction PCR|. Oligonucleotides that define the desired teimnn of tin ONA fragment are emptoyed at the 5' and 3' primers 

35 in tiie PCfl Pireferably, CHEPO polypeptide fragments share at least one biotogical and/or nununofogical activity widi tiie 
native CHEPO polypeptides shown m Figure 3 (SEQ ID N0S:2 and S|. 
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In particular embodiments, conservative substitutions of interest are shown in Table 6 under the heading of 
preferred substitutions. If such substitutions result in a change in biological activity, then more substantial changes, 
denominated exemplary substitutions in Table 6, or as further described below in reference to amino acid classes, are 
introduced and the products screened 

5 

Table 6 



OriQlnal Exemplary Preferred 

Residue Substitutions Substitutions 

Ala (A) vatteu;ile val 

10 Arg(R) lys;gin;asn lys 

Asn(N) gin; his; lys; arg gtn 

Asp(D) glu glu 
Cys(C) ser ser 

GlnlQ) asn asn 

15 Glu(E) asp asp 

6ly(G) pro;aia ala 

His(H) asn; gin; lys; arg arg 

fie (I) leu; val; met; ala; phe; 

norleucine iau 

20 Laud) norieiicine;ite;val; 

met; ala; phe He 
Lys(K) arg; gin; asn arg 

MetIM) leu;phe;ae leu 
Phe(F) leu; val; lie; ala; tyr leu 

25 Pro(P) ab ala 

SerlS) thr tlir 

ThrfD ser ser 

TrplWI tyr;phe tyr 
Tyr(YI trp; phe; thr; ser phe 

30 Vd(V) ae; tea; met; phe; 

ala; norleucine leu 



Substantial modifications in function or invnunological idcaittty of the CHEPO potypeptide are accomplished by 
sdectbig substitutions that differ significamly in their effect on maintainbig (a) the structure of the polypeptide backbone 
35 in the area of the substitution, for example, as a sheet or helical conformation, (b) tiie charge or hydrophobidty of die 
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molecule at the target site, or (c) the bulk of the side chain. Naturally occurring residues are divided into groups based on 
common side-chain properties: 

(1) hydrophobic: norleucine, met, aia, val, teu, ile; 

(2) neutral hydrophilic: cys, ser, thr; 
5 (3) acidic: asp, glu; 

(41 basic: asn, gin, his, lys, arg; 

(5) residues that influence cham orientation: giy, pro; and 

(6) aromatic: trp, tyr, phe. 

1 0 Non-conservative substitutions wiO entail exchanging a member of one of these classes for another class. Such 

substituted residues also may be introduced into the conservatwe substitution sites or, more preferably, mto the remaining 
(non-conserved) sites. 

The variations can be made using methods known in the art such as oligonucleotide-mediated (site-directed) 
mutagenesis, alanine scanning, and PCR mutagenesis. Sitenlirected mut^enesis [Carter et al., Nucl. Acids Res.. 13:4331 
15 (1986); Zoller et al., Nucl. Acids Bes.. 12:8487 {1987)1 cassette mutagenesis [Wells et al., GgQi, 34:315 (1985)], 
restriction selection mutagenesis [Wels et al., Phflos. Trans. R. Soc. London SerA. 317:415 (1986)] or other known 
techniques can be performed on the cloned DMA to produce the CHEPO variant DNA. 

Scanning amino acid analysis can also be employed to identify one or more amino acids along a contiguous 
sequence. Among the preferred scanning ammo acids are relatively small, neutral amino adds. Such amino acids mclude 
2 0 alanine, glycine, serine, and cysteine. Alanine is typically a preferred scanning amino acid among this group because it 
eliminates the side-chain beyond the beta-carbon and is less likely to alter the main-chain conformation of the variant 
(Cunningham and Weils, Science, 244: 108M08S (1989)]. Alanine is also typically preferred because it is the most 
' common amino acid. Further, it is frequently found in both buried and exposed positions [Creighton, The Proteins. (W.H. 
Freeman & Co., N.Y.); Chothia, J. MoL Bid.. 150:1 (1976)1. If alanine substitution does not yield adequate amounts of 

2 5 variant, an isoteric amino acid can be used. 

C. Modifications of CHEPQ 
Covaknt modifications of CHEPO are mcluded within the scope of this invention. One type of covalent 
modification mchides reacting targeted amino add residues of a CHEPO polypeptide with an organic derivatizing agent that 
30 Is capable of reacting with selected side diains or the N- or C- temdnal residues of the CHEPO. Derivatization with 
bifonctionai agents is useful for hstance, for crossEnklng CHEPO to a water-insoluble support matrix or surface for use 
in the method for purifying anti4:HEP0 antibodies, and vicewsa. Commonly used crossEnking agenu mdude, 1,1 - 
bis(diazoacetyD-2'phenytethane. ghitaraldehydo, Mliydroxysucdnindda esters, for examplG^ esters with 4-azidosalicyBc add, 
hmnobrfunctional imidoesters, imduding disucdnimidyl esters sudi as 3,T-dithiobis(succmimidyipropionate), bif mutional 

3 5 maleinudes sudi as bis4-maleimido-1,8Hict8ne and agents sudi as methyl-3-[(p*azidophenyl)dithio]propioimidate. 
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Other modifications incfude deamidation of glutaminyl and asparaglnyl residues to the corresponding glutamyl 
and aspartyl residues, respectively, hydroiylatlon of proline and lysine, phosphorylation of hydroxy! groups of seryl or 
threonyl residues, methylation of the -amino groups of lysine, arginine. and histldine side chains (T.E. Creighton. Proteins: 
Structure and Molecular Properties. W.H. Freeman & Co.. San Francisco, pp. 79-86 (1983)1 acetylation of the N-terminal 
5 amine, and amidation of any C-terminal carboxyl group. 

Another type of covalent modification of the CHEPO polypeptide included within the scope of this invention 
comprises altering the nath/e glycosylation pattern of the polypeptide. "Altering the native glycosylation pattern" is 
intended for purposes herein to mean deletmg one or more carbohydrate moieties found in native sequence CHEPO {either 
by removing the underlying glycosylation she or by deleting the glycosylation by chemicd andfor enzymatic means), and/or 
10 adding one or more glycosylation sites that are not present in the native sequence CHEPO. In addition, the phrase includes 
qualitative changes in the glycosylation of the native proteins, involving a change in the nature and proportions of the 
various carbohydrate moieties present. 

Addition of glycosylation sites to the CHEPO polypeptide may be accomplished by altering the amino acid 
sequence. The alteration may be made, for example, by the addition of. or substitution by. one or more serine or threonine 
15 residues to the native sequence CHEPO (for 0-linked glycosylation sites). The CHEPO amino acid sequence may optionally 
be altered through changes at the DNA level, particularly by mutating the DMA encoding the CHEPO polypeptide at 
presdected bases such that codons are generated that will translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the CHEPO polypeptide is by chemical or 
enzymatic coupling of glycosides to the polypeptide. Such methods are described in the art e.g., in WO 87f05330 
20 published 1 1 September 1987, and in Apiln and Wriston. CRC Crit. Rev. Binchem.. pp. 259-306 (1981). 

Removal of carbohydrate moieties present on the CHEPO polypeptide may be accomplished chemically or 
enzymatically or by mutational substitution of codons encoding for amino acid residues that serve as targets for 
glycosylation. Chemical deglycosylation techniques are known in the art and described, for instance, by Hakimuddin. et 
aL Arch. Biochem. Bionhvs.. 259:52 (1987) and bv Edoe et aL. Anal Biochem.. 1 18:131 (1981). Enzymatic cleavage of 
25 cartiohydrate moieties on polypeptides can be achieved by the use of a variety of endo- and exo-glycosidases as desaibed 
by Thotakura et aU Meth. Enzvmol. 138:350 (1987). 

Another type of covalent modification of CHEPO comprises linkmg the CHEPO polypeptide to one of a variety 
of nonproteinaceous polymers, e.g., polyethylene glycol (PEG), polypropylene glycol, or polyoxyalkylenes. in the manner 
set forth in UA Patent Nos. 4.640.835; 4.498,689; 4.301,144; 4.670.41 7; 4.791.192 or 4.179.337. 
30 The CHEPO of the present imrentkm may also ba modified in a way to form a chimeric molecule comprising 

. CHEPO fused to another, heterologous polypeptide or andno acid sequence. 

In one embodimentr such a cUmeric molecule comprises a fusion of the CHEPO with a tag polypeptide which 
provittes an epitope to whUi an anti-tag antiipdy can selectively Und. The epitope tag is generally placed at tfie amino- 
or caifaoxyl- temimus of Mia CHEPO, The presence of such epitope-tagged forms of tf» CHEPO can be detected usmg an 
35 antaiodyagamstthotagpoiypeptide. Alsi^ provision of the epitope tag enables tin CHEPO to be rea^ 

purifieation usmg an anthtag antOiody or another type of affniity matrix tint bmds to tiie epitope tag. Various tag 
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polypeptides and their respective antibodies are well known in the art Examples include poly-histidine (poly-his) or poly- 
histidine-glycine (poly-his-gly) tags; the flu HA tag polypeptide and its antibody 12CA5 [Field et al., MoL CelLBioL 8:2159- 
2165 (1988)1; the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 antibodies thereto [Evan et al.. Molecular and 
CelMar Biology, 5:3610-3616 (1985)]; and the Herpes Simplex virus glycoprotein 0 (gD) tag and its antibody [Paborsky 
5 et dU Protein Engineering, 3(61:547-553 (1990)]. Other tag polypeptides tnchtde the Flag-peptida [Hopp et aL, 
BioTechnology, 6:1204-1210 (1988)1; the ia3 epitope peptide (Martin et aU Sctnt^e, 25g:192-194 (1992)]; an -tubulin 
epitope peptide (Skinner et al., J.Biol. Chem.. 266:15163-15166 (1991)]; and the T7 gene 10 protein peptide tag Ibiti- 
Freyermuth et aU Proc. Natl. Acad. Sci. USA. fi7:6393-6397 (1990)1. 

In an i^emative embodiment, the chimeric molecule may comprise a fusion of the CHEPO with an immunoglobulin 

10 or a particular region of an immunoglobulin. For a bhralent form of the chimeric molecule (also referred to as an 
immunoadhesin ), such a fusion could be to the Fc region of an IgG molecule. The Ig fusions preferably include the 
substitution of a soluble (uansmembrane domain deleted or inacthrated) form of a CHEPO polypeptide in place of at least 
one variable region within an Ig molecule. In a particulariy preferred embodiment, the anmunoglobulin fusion inchides the 
hinge, CH2 and CH3, or the hinge, CHI, CH2 and CH3 regions of an IgGI molecule. For the production of immuno^obulin 

15 fusions see also US Patent No. 5,428,130 issued June 27, 1995. 

0. Preparation of CHEPO 

The description bdow relates primarily to production of CHEPO by odturing celb transformed or transfected with 
a vector containing CHEPO nucleic acid. It is, of course, contemplated that altemathre methods, which are we! known 

20 in the art, may be employed to prepare CHEPO. For instance, the CHEPO sequence, or portions thereof, may be pnubced 
by direct peptide synthesis using solid-phase techniques [see, e.g., Stewart et al., Sofid-Phase Pentide Synthesis. WJt 
Freeman Co., San Frandsco, CA (1969); Merrifield, J. Am. Chem. Soc. 85:2149-2154 (1963)]. h vitro protein synthesis 
may be performed us'mg manual techniques or by automation. Automated synthesis may be accomplished, for instance, 
using an Applied Biosystems Peptide Synthesizer (Foster City, CA) using manufacturer's instruc^^^^^^ Various portions of 

25 the CHEPO may be chemically synthesized separately and combined using chemfa;al m enzymatic methods to produce the 
fuD-tength CHEPO. 

1. isolation of DNA Encodino CHEPO 
DMA encoding CHEPO may be obtained from a cONA Bhrary prepared from tissue believed to possess the CHEPO 
30 mRNAamltoexpressitatadetectableioveL Accordingly, human CHEPO DMA can be convmiently obtained from a cD^ 
library prepared from human tissue, such ss described in the Examples. The CHEPO-encoding gene may also be obtained 
from a genomic library or by known synthetic procedures (e.g., automated mideic ecid synthesis). 

Libraries can be screened with probes (such as antBnifies to the CHEPO or orpnudeotides of at hast about 20* 
80 bases) designed to idoitify the gene of interest or the proton encoded by it Screenmg the cONA or genomh: Gbrary 
35 with the selected probe may be comlucted using standani procedures, such as described in Ssmbrook et al., Motecular 
(;|p!iiroi;AMbjWoryMw»Wl (New York Cold Spriiq Harbor Laboratory Press, 1 989). An ahematwe means to isolate 
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the gene encoding CHEPO is to use PCR methodology [Sambrook et sunra: Dieffenbach et al.. PCR Primen A Lahoratorv 
Manual (Cold Spring Harbor Laboratory Press, 1995)]. 

The Examples below describe techniques for screening a cDNA Gbrary. The oligonucleotide sequences selected 
as probes should be of sufficient length and sufficiently unambiguous that false posithres are minimized. The 
5 oGgonudeottde is preferably labeled such that it can be detected upon hybridization to DNA in the library being screened. 
Methods of labeling are weN known in the art. and include the use of radiolabels like ^P-labeled ATP, biotinylation or 
enzyma labefing. Hybridization conditions, inchiding moderate stringency and high stringency, are provided in Sambrook 
et aL sunra. 

Sequences identified in such fibrary screening methods can be compared and alined to other known sequences 
1 0 deposited arid available in public databases such as GenBank or other private sequence databases. Sequence identity (at 
either the amino add or nucteotide level) within defined regions of the molecule or across the fulMength sequence can be 
determined using methods known in the art and as described herein. 

Nudeic add having protein coding sequence may be obtained by screening selected cDNA or genomic libraries 
using the deduced amino acid sequence disclosed herem for the fvst time, and, if necessary, using conventional primer 
X 5 extension procedures as described in Sambrook et a!., sunra. to detect precursors and processing intermediates of mRNA 
that may not have been reverse-transcribed into cONA. 

2. Selection and Transfomiation of Host Cells 
Host cells are transfected or uansformed with expression or cloning vectors described herein for CHEPO 

20 production and cultured in conventional nutrient media modified as appropriate for inducing promoters, selecting 
transf onmants, or amplifying the genes encoding the desired sequences. The culture conditions, sudi as media, 
tonperature, pH and the like, can be selected by the skilled artisan without undue experimentation. In general principles, 
protocols, and practical techniques for maximizing the productivity of cell cultures can be found in Mammalian Cell 
Biotechnoloov: a Practical Annroach. M. Butler, ed. (IRL Press, 1991) and Sambrook et al., suora . 

25 . Methods of eukaryotic cell transf ection and prokaryotic cell transformation are known to the ordinarily skilled 

artisan, for example, CaCl2, CaP04, liposome-mediated and electroporation. Depending on the host cell used, 
transformation is performed using standard techniques appropriate to such ceHs. The caUum treatment employing caldum 
ddorido, as described in Sambrook et aU suera. or electroporation is generally used for prokaryotes. Infection with 
Apobsctm&m Wm^dm is used for transformation of certahi plant cells, as (tescribed by Shaw et aU Gene, 22:31 5 

30 (1983) and WO 89105859 pdilished 29 JiHie1S89. For mammalim ceBsvrithoutsudicen walls, the caldum pta^ 
precvitation method of Graham ami van dor Eb, Virohmv, §2^(58457 (1978) can be employel General aspects of 
mammatan ceB host system transfections have been descrMm Transformations in^ yeast 

are typically carried out accenfing to the method of Van Sofingen et al^ J. Bact. 130:946 (1977) and Hsiao et aL, Proc 
WatL Acad. Sd. (USA1. 2g:382g (1878). However, other methods for introdudng DNA into cells, such as liy nuclear 

3 5 microaijection, dectroporation, bacterial protoplast fudon with mtact cells, or polycationSi polybrm^ polyomithine. 
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may also be used. For various techniques for transforming mammalian cells, see Keown et al., Methods In EnzvmnlpflYr 
185:527-537 (19901 and Mansour et aL, Nature. ^:348-352 (1988). 

Suitable host ceils for cloning or expressing the ONA in the vectors herein include prokaryote, yeast, or higher 
eukaryote cells. Suitable prokaryotes include but are not limited to eubacteria, such as Gram-negative or Gram-positive 
5 organisms, for example, Enterobacteriaceae such as £ col Various £ cQSitim% are publicly available, such as £ coli 
K12 strain MM294 lATCC 31,446); £ coStMn (ATCC 31,537); £ cof strain W31 10 (ATCC 27,325) and K5 772 (ATCC 
53,635). Other suitable prokaryotic host cells inchide Enterobacteriaceae such as Eschendda, e.g., £ coS, Bitemtaeter, 
Emma, X/absieb, hatm, Salamaf/a, e.g., SahmmeSa tyiMimium, Sanatia, a^., Smatia mrcascans, and Sh^^, 
as well as 0ffc/¥such as A subm and A la^iemhmb te.^, A Sch&dfonm 41P disclosed in DD 266,710 published 12 

10 AprO m^Psmhmnasm^i^^P.aan^'msa, zuiSmtamyc^, These examples are Blustrative rather than limiting. 
Strain W31 10 is one particularly prefened host or parent host because it is a common host strain for recombinant DNA 
product fermentations. Preferably, the host cell secretes minimal amounts of proteolytic enzymes. For example, strain 
W31 10 may be modified to effect a genetic mutation in the genes encoding proteins endogenous to the host widi examples 
of such hosts inctudmg £ coSf W31 10 strain 1 A2, which has the complete genotype UmA ; £ coSm\ 1 0 strain 9E4, which 

15 has the complete genotype tonA ptr3: £ coSm\ 10 strain 27C7 (ATCC 55,244), which has the complete genotype Avz4 
ptr3phoA £15 fargf lacJfSS degP ompTkarf; £ eoffmm strain 3706, which has the complete genotype tonA ptr3 
phoA E1S (argf lac}!69 degP ompT rbs7Mkati\ £ fo/f'W3110 strain 40B4, which is strain 3706 with a non- 
kanamycin resistant degP deletion mutation; and an £ r^/r strain having mutant periplasmic protease disclosed in U.S. 
Patent No. 4,946,783 issued 7 August 1990. Alternatively, m vitro methods of cloning, PCR or other nucleic add 

20 polymerase reactions, are suitable. 

In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable cloning or 
expression hosts for CHEPO-encoding vectors. Saccharomyces cerevisiae is a commonly used lower eukaryotic host 
microorganism. Others include Schizosaccharomyces pomba (Beach and Nurse, Nature. 290 : 140 119811; EP 1 39,383 
published 2 May 1985); Kb/ymvmycas hosts (U.S. Patent No. 4,943,529; Fleer eta/., Bio/Technolooy. 9: 968-975 (1991)) 

25 such as, ft^., a: lactis IMW98-8C, CBS683, CBS4574; Louvencourt et ai., J. Bacteriol.. 737 11983]), AT. frag^s (ATCC 
12,4241 K. btiga/kus (ATCC 16,045), K. wickeramSmZZ 24,178), K. W8fr/7(ATCC 56,500), K. dmophitarum (ATCC 
36,906; Van den Beig era£, Bio/Technoloov. g: 135 (1890)), K. thmmtobm, and K. imxbnus; yanowia (EP 402,226); 
Phidapastmb (EP 183,070; Staekrishna «f a£, J. Basic Microbiol.. 28: 265-278 [1988D; Camm T/khoibnnam&a{EP 
2U^Y.Kauiospommsso{ZasBataf., Proc Natl Acad. Sd. USA. J^. S259-5263 [WmSehwairnhmyi^satAi as 

3 0 SdtwatBmnfeos mOkntaa (EP 394,538 published 31 October 1 990); and filamantous fungi sudi ai, ff^ Keiffospota, 
PmOmi, Tei/poebiaBn (WO 91100357 published 10 Jamnry 1991), mtdAsfieigms hots sudi as A nSftAm (Balam» 
stal.. B}^^flim\vift.i^(immtf m 284-289 (1983]; Tilbum 0f aI, Sm 2&: 205-221 (1983]; Yelton etai.. 
PrW, Mr MSA, SI: 1470-1474 [1984]) ami A. tdgar (KeBy aod Hynes, EMBOJ.. 4: 475479 [1985D. 

Methylotropic yeasts are sutahle herdn and induda, but ara not Bmitad to, yeast capable of grtnivth on methanol seiactad 

35 fnmi the genera condsting of /bnsan^ ASst 
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of specific spedes that are exemplary of this class of yeasts may be found In C. Anthony, The Biochflmistry of 
MethvlotroDhs. 26911982). 

Stfltable host ceOs for the expression of glycosylated CHEPO are derived from multicellular organisms. Examples 
of invertebrate cells include insect ceUs such as Drosophila S2 and Spodoptera Sf 9, as well as plant cells. Examples of 
5 useful mammafian host cell ines mclude Chniese hamster ovary (CHO) and COS cells. More specific examples include 
monkey kidney CVl line transfomted by SV40 (C0S7, ATCC CRL 1651); human embryonic kidney line (293 or 293 ceHs 
subcloned for growth in suspension culture, Graham et aL, J.GenVffoL ^:59 (1977)); Chinese hamster ovary cells/-DHFR 
(CHO, Uriaub and Chasin, Proc. Natl. Acad, Sd. USA 77:4216 (1980)); mouse Sertoli ceOs (TM4, Mather, BioL Reprod.. 
23:243-251 (1980)); human lung cells (W138, ATCC CCL 75); human Ihrer cells (Hep 02, HB 8065); and mouse mammary 
10 tumor (MMT 060562, ATCC CCL51). The selectian of the appropriate host ceU is deemed to be within the skill in the art 

3. Selection and Use of a Repficable Vector 
The nucleic acid {B,g,, cDNA or genomic DNA) encoding CHEPO may be inserted into a replicabie vector for 
cloning (amplification of the ONA) or for expression. Various vectors are publicly available. The vector may, for example, 
15 be in the form of a plasmid. cosmid, viral particle, or phage. The appropriate nucleic acid sequence may be inserted into 
the vector by a variety of procedures, in general, DNA is inserted into an appropriate restriction endonuclease site(s) using 
techniques known in the art. Vector components generally include, but are not limited to, one or more of a signal sequence, 
an origin of replication, one or more marker genes, an enhancer element, a promoter, and a transcription termination 
sequence. Construction of suitable vectors containing one or mora of these componenU employs standard ligation 
20 techniques which are known to the skilled artisan. 

The CHEPO may be produced recombinantly not only directly, but also as a fusion polypeptide with a 
heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific cleavage site at the N- 
terminus of the matue protein or polyp^de. In geteral, the signal sequence may be a component of the vector, or it may 
be a part of the CHEPO-encoding DNA that is inserted into the vector. The signal sequence may be a prokaryotic signal 

2 5 sequence selected, for example, from the group of the alkaline phosphatase, peniciUinase, Ipp, or heat-stable enterotoxin 

II leaders. For yeast secretion the signal sequence may be, the yeast invertase leader, alpha factor leader (indudmg 
Saeehnunyces and Kiiymomyees -factor leaders, the latter described in ILS. Patent No. 5,010,182), or acid 
phosphatase leader, the C. aOricm ghKoamylase leader (EP 362,1 79 published 4 Aprfl 1 890), or the signal descraied in 
WO 90113646 published 15 November 198a In mammaSan cell axpression. mammalian sipal sevieacesmay be used 
30 to direct secretion of the proteia such as signal sequences from secreted polypeptides of the same or related spedes. as 
well as viral secretory leaders. 

Both expression and cbnmg vecton contam a nucleic add sequence that emiUes the vector to repGcate in one 
or more selected host cells. Sudi sequences are well known for a variety of bacteria, yeast, and viruses. The origin of 
replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2 plasmid origni is suitable for yeast 

3 5 and various viral origins (SV40, polyoma, adenovirus, VSV or BPV) are useful for doning vectore m mammaGan cells. 
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Expression and cloning vectors will typicallY contain a selection gene, also termed a selectable marker. Typical 
selection genes encode proteins that {a) confer resistance to antibiotics or other toxins, e.g., ampicillin, neomycin, 
methotrexate, or tetracycline, (b) complement auxotrophic deficiencies, or supply critical nutrients not available from 
complex media, e.g., the gene encoding D aianine racemase foTBaallL 
5 An example of suitable selectable markers for mammalian cells are those that enable the identification of cells 

competent to take up the CHEPO-encoding nucleic add, such as DHFR or thymidine kinase. An appropriate host cell when 
wild-type DHFR is employed is the CHO cell line deficient in DHFR activity, prepared and propagated as descnbed by Urlaub 
8t al., Proc. Natl. Acad. Set. USA. 77:421 6 (1 980). A suitable selection gene for use in yeast is the trp] gene present in 
the yeast plasmid YHp7 [Stinchcomb et al.. Nature, 282:39 (1979); Kingsman et al., ggQe, 7:141 (1979); Tschemper et 
10 9L, Gens, IQ:1S7 (1980)1. The trp} gene provides a selection marker for a mutant strain of yeast lacking the ability to 
grow in tryptophan, for example, ATCC No. 44076 or PEP4-1 (Jones, Genetics. 85:12 (1977)]. 

Expression and cloning vectors usually contain a promoter operably linked to the CHEPO-encoding nucleic add 
sequence to direct mRNA synthesis. Promoters recognized by a variety of potential host cells are well known. Promoters 
suitable for use with prokaryotic hosts indude the -lactamase and lactose promoter systems [Chang et al., Nature, 
15 275:615 (1978); Goeddel et al., Nature. 231:544 (1979)1, alkaline phosphatase, a tryptophan (trp) promoter system 
[Goeddel, Nudeic Acids Res.. 8:4057 (1980); EP 36,7761 and hybrid promoters such as the tec promoter [deBoer et al., 
Proc Natl. Acad. ScL USA. 80:21-25 (1983)]. Promoters for use in bacterial systrans also will contain a Shine-Da^amo 
(S.D.) sequence operably linked to the ONA encoding CHEPO. 

Examples of suitable promoting sequences for use with yeast hosts indude the promoters for Siihospboglycerate 

2 0 kinase [Hitzeman et al., J. Biol. Chem.. 255:2073 (1 980)] or other glycolytic enzymes [Hess et aL, J. Adv.EnzvmflHeo.. 

7:149 (1968); Hdland, Biochemistry. 17:4900 (1978)1 such as enolase, glyceraldehyde-3-phosphate dehydrogenase, 
hexokinase, pyruvate decarboxylase, phosphofructokinaser glucose-6-phosphate isomerase, S^thosphoglycerate mutase, 
pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and giucokinase. 

Other yeast promoters, whid) are induc&le promoters having the additional advantage of transcription controOed 
25 by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, add phosphatase, 
degradative enzymes assodated with nitrogen metabolism, metailothionein, glyceraldehyde-3-phosphate dehydrogenase, 
and enzymes responsble for maltose and galactose utilization. Suitable vectors and promoters for use in yeast expression 
are further described in EP 73,657. 

CHEPO transcription from vectors in mammaRan host cells is controlled, for example, by promoters obtained from 

3 0 the oemHM of viruses sudi as pdyoma ma, fowlpox virus (UK Z21 1,504 pubOshed 5 July 1989), adenovirus (sudi as 

Adenovirus 2), bovine papiBoma viruSr avian sarcoma virus, cytomegalovirus, a retrovirus, hepatitis-B virus and Sonmn Virus 
40 (SV40L from Interologous Riammalian priBmoters, e^^, the actin promoter or an immunoglobulin promoter, and from 
heat-shock promoters, provided such promoters are compatible with the host cell systoans. 

Transcription of a DMA encoding the CHEPO by higher eukaryotes may be tacreased by inserting an enhancer 
3 5 sequence into the vector. Enhancers are cis-actmg elemms of DMA, usually about from 10 to 300 bp, that act on a 
promoter to increase iu transcription. Many enhancer sequences are now knovvn from mammaEan genes (glo^ 
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albumin, -fetoprotein, and insulin). Typically, however, one will use an enhancer from a eukaryotic ceil virus. Examples 
include the SV40 enhancer on the late side of the replication origin (bp 100-270). the cytomegalovirus early promoter 
enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers. The enhancer may 
be spliced into the vector at a position 5' or 3* to the CHEPO coding sequence, hut is preferably located at a site 5' from 
5 the promoter. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or nucleated cells 
from other multicellular organisms) will also contain sequences necessary for the termination of transcription and for 
stabilizing the mRNA. Such sequences are commonly available from the 5' and, occasionally 3'. untranslated regions of 
eukaryotic or viral DMAs or cDNAs. These regions contain nucleotide segmenu transcribed as polyadenylated fragments 
10 in the untranslated portion of the mRNA encoding CHEPO. 

Still other methods, vectors, and host cells suitable for adaptation to the synthesis of CHEPO in recombinant 
vertebrate cell culture are described in Gething et al„ Mature. 293:620-625 (1981); Mantei et al.. Nature. 281:4046 
(1979); EP117,060; and EP117.058. 

^5 4. Oetectino Gang Amnlification/ExnrBMiQn 

Gene amplification andfor expression may be measured in a sample directly, for example, by conventional 

Southern blotting, Northern blotting to quantitate the transcription of mRNA [Thomas, Proc. NatL Acad. Sci. USA. 

77:5201-5205 (1980)}. dot blotting (DNA analysis), or wsyto hybridization, using an appropriately labeled probe, based 

on rtie sequences proviited herein. Altemathrely, antibodies may be employed that can recognize specific duplexes inchiding 
20 DNA duplexes. RNA duplexes, and DNA-RNA hybrid duplexes or ONA-protein duplexes. The antiboAs in turn may be 

labeled and tiie assay may be carried out where tiie duplex is bound to a surface, so that upon the formation of duplex on 

the surface, the presence of antibody bound to the duplex can be detected. 

Gene expression, altemathrely, may be measured by immunotogical methods, such as immunohistochemical 

staining of ceUs or tissue sections and assay of cell culture or body fluids, to quantitate directly tije expression of gene 
25 product Antibodies useful for immunohistochemh:al staining and/or assay of sample fluids may be either monodonal or 

polydonal, and may be prepared m any mammal Convenientiy, the antibodies may be prepared against a native sequence 

CHEPO polypeptide or against a syntiietic peptide based on the ON A sequences provided herein or against exogenous 

8«|uence fused to CHEPO DNA and encoding a specific antibody epitope. 

30 5. Purification of Pfllvniiiitirfp 

Forms of CHEPO may be racovered from cufture meifium or from host cell lysates. If membrane-bound, it can 
ba rebasMl from tin membrane using a suitable detergent solution (ft^. Triton-X 100) or by enzymatic cleavage. CcBs 
employed in expression of CHEPO can ba disrupted by various physical or chemical means, such as freezs-thaw cycling, 
soiHcation, mechaincd disruption, or ceD lyshg agents. 
35 It may ba desired to purify CHEPO from recombinant cell proteins or polypeptides. The following procedures are 

exemplary otsintaUe purification procedures: by fractnination on an ionwhange cohmiv ethanol precqiitation; reverse. 

64 



wo 00/68376 



PCTAJSOO/12370 



phase HPLC; chromatography on silica or on a catioMXchanee resin such as OEAE; chromatofocusing; SOS-PAGE; 
ammonium sulfate precipitation; gel filtration using, for example. Sephadex G-7S; protein A Sepharose columns to remove 
contaminants such as IgG; and metal chelating columns to bind epitope-tagged forms of the CHEPO. Various methods of 
protein purification may be employed and such methods are known in the art and described for example in Deutscher, 
5 Methods in EnzYmoloov. 182 (19901; Scopes, Protein Purification: Principles and Practice. Springer-Verlag, New York 
(1982). The purification step(s) selected will depend, for example, on the nature of the production process used and the 
particular CHEPO produced. 

L Uses for CHEPO 

10 Nucleotide sequences (or their complement) encoding CHEPO have variouis applications in the art of molecular 

biology, inchiding uses as hybridization probes, in chromosome and gene mapping and in the generation of anti<sense RNA 
and DNA. CHEPO nucleic add wiD also be useful for the preparatimi of CHEPO polypeptides by the recombinant techniques 
described herein. 

The full-length native sequence CHEPO cDNA (SEQ ID N0:3|, or portions thereof, may be used as hybridization 
15 probes for a cONA library to isolate the full-length CHEPO cDNA or to isolate still otiier cONAs (for instance, tiiose 
encoding naturally-occurring variants of CHEPO or CHEPO from otiier species) which have a desired sequence identity to 
tiie CHEPO sequence disclosed in Figure 2 (SEQ 10 N0:3). Optionally, the length of the probes wiO be about 20 to about 
50 bases. The hybritfization probes may be derived from at least partially novel regions of ttie nucleotide sequence of SEQ 
ID N0:3 wherain those regions may be determined wMiout undue experbnentation or from genomic sequences mchidmg 
20 promoters, enhancer elements and introns of native sequence CHEPO. By way of example, a screenbig method wSI 
comprise oolating tiie codmg region of tfie CHEPO gene usmg tiie known DNA sequence to synthesize a selected probe of 
about 40 bases. Hybridization probes may be labeled by a variety of labels, inchiding ladionudeotides such as ^P or 
or enzymatic labels such as alkaline phosphatase coupled to tite probe via avidinftiiotin coupling systems. Labeled probes 
having a sequence complementary to that of tiie CHEPO gene of tiie present Invention can be used to screen libraries of 

2 5 human cONA, genomic ONA or mRNA to determine which members of such libraries the probe hybridizes to. Hybridization 

techniques are described m further detail in tite Examples below. 

Any EST sequences disdosed m tiie present application may similariy be employed as probes, usmg tin metfiods 
disdosedhmin. 

Otiier useful fragments of the CHEPO nucleic adds inchide antisense or sense oligonucleotides comprising a 

3 0 singe-stranded nudeic add sequence (eitfier RNA or ONA) cepable of binding to target CHEPO mRNA (sense) or CHEPO 

DNA (antisense) sequences. Antisense or sense ofigonudeotides, according to tiie present invention, comprise a fragment 
of tiie coifing region of CHEPO DNA. Such a fragment generally comprises at least about 14 nudeotides, preferably from 
about 14 to 30 micleotides. The ability to derive an antisense or a sense oligonudeotide, based upon a cDNA sequence 
encoding a gWen protein is described in, for example. Stein and Cohen (C«mAe& 48:2659, 1988) and van der Krol et 
35 9LmTechnitpf8sBm,mi]. 
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Binding of antisense or sense oligonucleotides to target nucleic add sequences results in the formation of 
duplexes that block transcription or translation of the target sequence by one of several means, inchjding enhanced 
degradation of the duplexes, premature termination of transcription or translation, or by other means. The antisense 
oligonucleotides thus may be used to block expression of CHEPO proteins. Antisense or sense oligonucleotides further 
5 comprise oGgonucleotides having modified sugar-p hosphodiester backbones (or other sugar Gnkages, such as those 
described in WO 91/06629) and wherein sudi sugar Gnkages are resistant to endogenous nucleases. Such nBgnfUfrteotides 
with resistant sugar finkages are stable m vm (i^., capable of resisting enzymatic degradation) but retain sequence 
spedf icity to be able to bmd to target nucleotide sequences. 

Other examples of sense or antisense ofigonucleotides inchide those oligonucleotides which are covalently linked 

10 to organic moieties, such as those descrOied in WO 90/10048, and other moieties that increases affinity of the 
oligonucleotide for a target nucleic acid sequence, such as poly-(Uysine). Further still, mtvcalating agents, such as 
ellipttcine, and alkylating agents or metal complexes may be attached to sense or antisense oligonucleotides to modify 
binding specificities of the antisense or sense oligonucleotide for the target nucleotide sequence. 

Antisense or sense oligonucleotides may be introduced into a cell containing the target nucleic add sequence by 

15 any gene transfer method, including, for example, CaP04-mediated ONA transfection, electroporation, or by using gene 
transfer vectors such as Epstein Barr virus. In a preferred procedure, an antisense or sense oligonucleotide is insmed mto 
a suitable retroviral vector. A cell containing the target nudeic add sequence is contacted with the recombinant retroviral 
vector, either in vivo or ex vivo. Suitable retroviral vectors include, but are not limited to, those derived from the murine 
retrovirus M MuLV, N2 (a retrovirus derived from M-MuLV)« or the double copy vectors designated OCTSA, OCTSB and 

20 0CT5C (see WO 90/13641). 

Sense or antisense oligonucleotides also may be introduced into a cell containing the target nudeotide S0|uence 
by foimation of a conjugate with a ligand binding molecule, as described in WO 91/04753. Suitable Ogand binding 
molecules indude, but are not limited to, cell surface receptors, growth factors, other cytokines, or other ligands that bind 
to cell surface receptors. Preferably, conjugatibn of the Ggand binding molecule does not substantially interfere with the 

25 abifity of the ligand binding molecule to bind to its corresponding molecule or receptor, or block entry of the sense or 
antisense ofigomicleotide or its conjugated version into the cbD. 

iUtematively, a sense or an antiseme oT^oniKleotHie may be introduM into a cefl containing the target nucleic 
add sequence by formation of an oBgmiudeotNiefpNl complex, as desoibed In WO 80/10448. The sense or aittseose 
olqonucleotide-lipid complex is preferably dissodated within the ceO by an emiogenous Iqnse. 

30 Itepnibes may also be employed n PGR tedmiques to generate a po^ 

related CHEPO codmgjsequences. 

Nudeotide sequences encqcfing a CHEPO caiv also be used to construct 
whnh encodes that CHEPO and for the genetic analysis (Df ihdhriduals with genetic disordera. The nudeotide sequences 
provided herein may be mapped to a chromosome and specific rqions of a chromosome using known techniques, such as 
3 5 ^snfSv hybridization, Bnkage andyds agdnst known chronrasomal markers, and hybridization screening with Ihraries. 
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When the coding sequences for CHEPO encode a protein which binds to another protein (example, where the 
CHEPQ is a receptor), the CHEPO can be used in assays to identify the other proteins or molecules invoh/ed in the binding 
interaction. By such methods, inhibitors of the receptor/iigand binding interaction can be identified. Proteins involved in 
such bindmg interactions can also be used to screen for peptide or small molecule inhibitors or agonists of the binding 
5 interaction. Also, the receptor CHEPO can be used to isolate correlative ligand(s). Screening assays can be designed to 
fmd lead compounds that mimic the biological acthrity of a native CHEPO or a receptor for CHEPO. Such saeening assays 
will include assays amenable to high-throughput screening of chonicai libraries, making them particularly suitable for 
identifying small molecule drug candidates. SmaB molecules contemplated inchjde synthetic organic or inorganic 
compounds. The assays can be performed in a variety of formats, inchiding protein-protein binding assays, biochemical 

10 screening assays, immunoassays and cell based assays, which are well characterized in the art. 

Nucleic adds which encode CHEPO or its modified forms can also be used to generate either transgenic animals 
or "knock out" an'nnals which, in turn, are useful in the development and screening of therapeutically useful reagents. A 
transgenic animal (e.g., a mouse or rat) is an animal having cells that contain a transgene, which transgene was introduced 
into the annnal or an ancestor of the animal at a prenatal, e.g., an embryonic stage. A transgene is a DNA which is 

1 5 integrated into the genome of a cell from which a transgenic animal develops. In one embodiment cDNA encoding CHEPO 
can be used to clone genomic DNA encoding CHEPO in accordance with estabfished techniques and the genomic sequences 
used to generate transgenic animals that contain cells which express DNA encoding CHEPO. Methods for generating 
transgenic animals, particularly animals such as mice or rats, have become conventional in the art and are described, for 
example, in U.S. Patent Nos. 4,736,866 and 4,870,009. TypicaQy, pellicular cells would be targeted for CHEPO transgene 

20 incorporation with tissue<specific enhancers. Transgenic animals that inchide a copy of a transgene encoding CHEPO 
introduced into the genn Gne of the animal at an embryonic stage can be used to examine the effect of increased expression 
of DNA encoding CHEPO. Such animals can be used as tester animals for reagents thought to confer protection from, for 
example, pathological conditions associated with its overexpression. In accordance with this facet of the invention, an 
animal is treated with the reagent and a reduced incidence of the pathological condition, compared to untreated animals 

2 5 bearing the transgero, would indicate a potential therapeutic intervention for the pathological condition. 

Ahmatively, non-human homologues of CHEPO can be used to construct a CHEPO "knock out" aramal which 
has a defective or altered gene encoding CHEPO as a result of homologous recombination between tiie endogenous gene 
encotfing CHEPO and altered genomic DNA encoding CHEPO introduced into an embryonic stem cell of tin animal For 
axampie, cDNA encoding CHEPO can be used to done gmomic DNA encoding CHEPO in accordance wMi estaUished 

30 technnpies. A portion of ti»gemiirac DNA encoding CHEPO can be deleted or replaced^ 

oicoding a seiectaUe marker which can iu used to monitor integration. TypkaBy, several kikibases of unaltered flanking 
DNA (botii at tiie 5' ami 3* amis) are mchided in tiie vector [see e.g^ Thomas ami Capecchl M Sl:503 (1887) for a 
description of homotagous recombmation vectors]. The vector is mtroduced into an endiryonic stem cell line {b4^ by 
dectfiiporatioh) and cells in which the mtroduced DNA has homologously recombined witii tfie endogenous DNA are 

35 selected [see e.g^ U et aL, Cd, 69:915 (1992)1. The setocted cells are titen Injected mto a blastocyst of an animal (e.g^ 
a mouse or rat) to fomi aggregatiui chimeras (see ag., Bradley, in fan^^ 
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Approach, E. J. Robertson, ed. (IRL, Oxford, 1987), pp. 1 13-1521. A chimeric embryo can then be implanted into a suitable 
pseudopregnant female foster animal and the embryo brought to term to create a "knock out" animal. Progeny harboring 
the homologously recombined DNA in their gemi ceOs can be identified by standard techniques and used to breed animals 
in which all cells of the animal contain the homologously recombined DNA. Knockout animals can be characterized for 
5 mstance, for their abifity to defend against certain pathological conditions and for their development of pathological 
conditions due to absence of the CHEPO polypeptide. 

Nucleic acid encoding the CHEPO polypeptides may also be used in gene tfierapy. In gene therapy applications, 
genes are introduced into cells in order to achieve ki mo synthesis of a therapeutically effective genetic product for 
example for replacment of a defective gene. •Gene tiwrapy" includes botii conventional gene therapy where a lasting 

1 0 effect is achieved by a single treatment and tiie adnrinistration of gene therapeutic agents, which involves tiie one time 
or repeated admmistration of a therapeutically effective DNA or mRNA. Antisense RNAs and DNAs can be used as 
tiierapeutic agents for blocking tiie expression of certain genes in mo. It has already been shown that short antisense 
oBgonudeotides can be nnported into cells where tiiey act as inhibitors, despite their low intracellular concentrations 
caused by tiieir restricted uptake by the cell membrane. (Zamecnik et Proc. Natl. Acad. Sci. USA gg, 41434146 

15 (19861). The oligonucleotides can be modified to enhance their uptake, e.g. by substituting their negatively charged 
phosphodiester groups by uncharged groups. 

There are a variety of techniques available for introducing nucleic adds into viable cells. The techniques vary 
depending upon whether tiie nudeic acid is transferred into cultured cells in vitro, or iv? wvo in tite cells of tin intended host 
Techniques suitable for the transfer of nudeic acid into mammalian cdls in vitro indude tiie use of Oposomes, 

2 0 electroporation, microinjection, cell fusion, DEAE-dextran, the caldum phosphate precipitation method, etc The currently 

preferred mo^ gene transfer tedmiques indude transf ection with viral (typically retrovirafl vectors and viral coat protein- 
liposome mediated transfection (Ozau et ai., Trends in Biotechnolnoy 205-210 11993]). In some situations it is 
desirable to provide the nudeic add source witii an agent that targets the target cells, such as an antibody specific for 
a cell surface membrane protein or the target cell, a ligand for a receptor on tiie target cell, etc. Where liposomes are 

25 employed, proteins whidi bind to a ceil surface membrane protein assodated with endocytosis may be used for targeting 
and/or to fadttate.uptake, e.g. capsid proteins or fragments tiiereof tropic for a particular cell type, antibodies for proteins 
wMdi umiergo intemaiization In cyding, proteins tiiat target intracellular localization and enham:e intraceOular half-life. 
The tedmique of receptor-imdiated endocytosis b ibscribed, for example, by Wu et aL, J. Biol. Chem. 262. 44294432 
(19871; and Wagner et aL. Proc, [tfafl, Acad, UgA gL.341&3414 (1990). For review of gene marking and gene 

30 therapy pmtocds see Arelefson gf rf,. Sdenca 25B. 80fl.8l3 nflfl?! 

The CHEPO polypeptides described herein may also be employed as molectdar weight marken for protein 
dectrophoresis purposes. 

The noddc add motecdes encoding the CHEPO polypeptides or fragments tfiareof described herein are useful 
for chromosome Mentification. bi this regard, there exists an ongoing need to UentHy new diromosome marken, since 

3 5 relativdy few diramosome marking reagents, based upon actual sequence data are presently available. Eadi CHB>0 

nudeic acid moiecde of tiie present invention can be used as a chromosoma marker. 
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The CHEPO polypeptides and nucleic acid molecules of the present invention may also be used for tissue typing, 
wherein the CHEPO polypeptides of the present invention may be differentially expressed in one tissue as compared to 
another. CHEPO nucleic acid molecules will find use for generating probes for PCR, Northern analysis. Southern analysis 
and Western analysis. 

5 The CHEPO polypeptides described herein may abo be employed as therapeutic agents. The CHEPO polypeptides 

of the present invention can be formulated according to known methods to prepare pharmaceuticaSy useful compositions, 
whereby the CHEPO product hereof is combined in admixture with a pharmacmitically acceptable carrier vehicle. 
Therapeutic formulations are prepared for storage by mnnng the active mgredient having the desired degree of purity with 
optional physiolo^caHy acceptable carriers, excipients or stabilizers (Reminoton's Pharmaceutical Sciences 16th edition, 

10 Osol, A. Ed. (1980U, in the form of lyophilized formulations or aqueous sohitions. Acceptable carriers, excipients or 
stabilizers are nontoxic to recipients at the dosages and concentrations employed, and inchide buffers such es phosphate, 
citrate and other organic adds; antioxidants incluifing ascoriiic acid; low molecular weight (less than ^out 10 residues) 
polypeptides; proteins, such as serum albumin, gelatin or immunoglobulins; hydrophilic polymers such as 
polyvinyipyrrefidone, amino acids such as glycine, ghitamine, asparagine, arginine or lysine; monosaccharides, disaccharides 

1 5 and otiier cartiohydrates including glucose, mannose, or dextrins; chelating agenU such as EDTA; sugar alcohols such as 
mannitol or sortritoi; salt-forming counterions such as sodium; and/or noniornc surfactants such as TWEEN™, PLURONICS™ 
or PEG. 

The formulations to be used htm mo administration must be sterile. This is readily accomplished by filtration 
through sterile filtration membranes, prior to or following lyophiGzation and reconstitution. 

2 0 Therapeutic compositions herein generaSy are placed into a container having a sterile access port for examfrie, 

an intravenous solution bag or vial having a stopper pierce^le by a hypodermic injection needle. 

The route of administration is in accord with known methods, e.g. injection or infusion by intravenous, 
intraperitoneal, intracerebral, intramuscular, inuaocular, intraarterial or intralesionai routes, topical administration, or by 
sustained release systems. 

25 Dosages and desired drug concentretions of pharmaceutical compositions of the present invention may vary 

depending on tiie particular use envisioned. The determination of the appropriate dosage or route of administration is well 
w'rthbi the skill of an onSnary phystdan. Animal expernnents provide reliable guidance for the determination of effective 
doses for human therapy. Interspedes scaling of effective doses can be perfonned f olowing the prindples hid down by 
Mordenti, J. and Chappall W. "The use of mterspades scafing in toxicokinetics" jn Toxicokinetics and New Drug 

30 Development Yacdri era/., EdirPergamonPrBss, New York 1989, pp. 42-9EL 

When A vifo administration of a CHEPO polypeptide or agonist or antagonist tinreof b emphiyed, normd dosage 
amounts may vary from about 10 ngfkg to up to 100 mg/kg of mammal body weight or more per day« prefmbty about 1 
gfkgfday to 10 rnglkgfday, depemEng upon the route of idministration. Guidance as to particular dosages and methods 
of defivery is provided in the literature; see, for example, U.S. Pat Nos. 4,657.760; 5^6,344; or 5,225,212. It is 

3 5 anticipated tiiat different formulations wiO be effective for different treatment compounds and dif fermt disorders, that 
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administration targeting one organ or tissue, for example, may necessitate delivery in a manner different from that to 
another organ or tissue. 

Where sustained-release administration of a CHEPO polypeptide is desired in a formulation with release 
characteristics suitable for the treatment of any disease or disorder requiring administration of the CHEPO polypeptide, 
5 microencapsulation of the CHEPO polypeptide is comemplated. Microencapsulation of recombinant proteins for sustained 
release has been successfully performed with humai growth hormone (rhGH), interferon- (rhlFN- ), interleukin-2, and MN 
rgp120. Johnson et a!., Nat. Med„ 2: 795799 (1996); Yasuda. Biomed Ther.. 22: 122M223 (19931; Mora et ai., 
BiofTechnoloov. 8: 755758 (1990); Cleland, "Design and Production of Single Immunization Vaccines Using Polylactide 
Polyglycolide Microsphere Systems." in Vaccine Desion: The Subunit and Adjuvant Aooroach. Powell and Newmanr eds, 
10 (Plenum Press: New York. 1995). pp. 439462; WO 97/03692. WO 96/40072. WO 96/07399; ami U.S Pat No. 5,654.010. 

The sustained-release formulations of these proteins were developed using poly-lactic-coglycolic acid (PL6A) 
polymer due to its btocompatibility and wide range of biodegradable properties. The degradation products of PLGA, lactic 
and glycolic acids, can be cleared quickly within the human body. Moreover, the degradabOity of this polymer can be 
actusted from months to years depending on its motecuhr weight and composition. Lewis. "Controlled release of bioactive 
15 agents from lactide/gtycoOde polymer." im M. Chasin and R. Langer (Eds.). Biodegradable Polvmers as Oruo Delhferv 
Systems (Maicei Oekker. New York. 1990). pp. 141. 

This invention encompasses methods of screening compounds to identify those that mimic the CHEPO 
polypeptide (agonists) or prevent the effect of the CHEPO polypeptide (antagonists). Screening assays for antagonist dn^ 
candidates are designed to identify compounds that bind or complex with the CHEPO polypeptides encoded by the genes 
2 0 identified herein, or otherwise interfere with the biteraction of the encoded polypeptides with other cellular proteins. Such 
screening assays will include assays amenable to high-throughput screening of chemical libraries, making them particularly 
suitable for identifying small molecule drug candidates. 

The assays can be performed in a variety of formats, including protein-protein bind'mg assays, biochemical 
screening assays, immunoassays, and cell-based assays, which are well characterized in the art AD assays for 

2 5 antagonists are common in that they call for contacting the drug candidate with a CHEPO polypeptide encoded by a nucleic 

acid identified herein under conditions and for a tone sufficient to allow these two components to interact 

In bmding assays, the interaction is binding and the complex formed can be isolated or detected in the reaction 
mixture. In a particular endiodimeiil the CHEPO polypeptide encoded by the gene identified herein or tfie drug candidate 
is immobiBzed on a sofid phases eg., on a microtiter plate, by covalent or non-covalent attachments. Non^ovatent 

3 0 attachment generally is accompfished by coating tin soGd surface widi a sohition of die CHEPO polypeptide and drying. 

Alternatively, an immobilized antibody, & a monodonfll antibody, specify im tiie CHEPO polypeptide to be immobilized 
can be used to anchor it to a soGd surface. The assay is perf onned by adding die notHmmobiGzed conqranem. which may 
be labeled by a detecteUe label, to d» fanmobilized component e.g^ die coated surface contamtng die anchored 
compmnt. When die reaction is complete, die non-reacted components are removed, a^., by washmg, and complexes 
35 anchored on tin solhl surface are detected. Whentiworiginailynon-bnmoblizedcomp^ 

detection of label immobilized on die surfece mdicates diat comphxing occurred. Where die originally non-immobilized 

70 
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component does not carry a label compiexing can be detected for example, by using a labeled antibody spedftcaOy binding 
the immobilized complex. 

If the candidate compound interacts with but does not bind to a particular CKEPO polypeptide encoded by a gene 
identified herein, its interaction with that polypeptide can be assayed by methods well known for detecting protein-protein 
5 interactions. Such assays include traditional approaches, such as, e.g., cross-linking, co-immunoprecipitation, and co* 
purification through gracfients or dvomatographic cokonns. In addition, protein-protein interactions can be monitored by 
using a yeast-based genetic system described by Holds and co-workers (Helds and Song, Nature (London). 340: 245-246 
(1989); Chien 8t at,. Prta, Nal Acad. Sd. USA, fig: 9578-9582 (1991)) as disclosed by Chevray and Nathans, ProcNatl. 
Apad.Scj. USA, fig: 5789-5793 (1991). Many transcrqitionai activators, such as yeast GAL4, consist of two physically 

1 0 discrete modular domains, one acting as the DNA-bindmg domain, the other one functioning as the transcrqition-activation 
domain. The yeast expression system described in die foregoing publications (generally referred to as the "two-hybrid 
system^ takes advantage of this property, and employs two hybrid proteins, one in which tiie target protein is fused to 
the DNA-binding domain of 6AL4, and another, in which candidate activating proteins are fused to tfie activation domain. 
The expression of a HkLVbcl reporter gene under control of a GAL4-acttvated promoter depends on reconstitution of 

1 5 G AL4 activity via protein-protein interaction. Colonies contain'mg interactmg polypeptides are detected with a chromogenic 
substrate for galactosidase. A complete kit (MATCHMAKER™) for identifying proteinfrotein interactions between two 
specific proteins using the two-hybrid technique is commerdaDy ava^able from Ctontech. This system can also be extended 
to map protein domains involved in spedfic protein interactions as well as to pinpoint amino add residues that are crudal 
for these interactions. 

2 0 Compounds that interfere with the interaction of a gene encoding a CHEPO polypeptide identified herein end 

other intra- or extracellular components can be tested as follows: usually a reaction mixture is prepared containing the 
product of the gene and tiie intra- or extracellular component under concfitions and for a time allowing for the interaction 
and binding of the two products. To test tiie ability of a candidate compound to inhibit binding, tiie reaction is run in tiie 
absoice and in tiie presence of tiia test compound. bi addition, a placebb may be added to a titird reaction mixture, to s&ve 

25 as positive control. The binding (complex formation) between tha test compound and tiie intra- or extraceOulaf component 
present in the mixture is monitored as descrSied hereinabove. The f onnation of a complex in the control reaction(s) but 
not in ti» reaction mixture containing tiie test compound indicates tfiat tiie test compound interferes with tiie interaction 
oif the test conqiound and its reaction partner. 

To assay for antagonists, the CHEPO polypeptide may be added to a cefl along witii tiie compound to be scremd 

30 for a particular activity and tiie ability of tfie compound to bihibit tiie activity of interest in tin presence of the CHEPO 
pdypqitide intcates that die compound is an antagonist to the CHEPO polypeptide. Alternatively, antagonists may be 
detected by condiimng du CHEPO polypei^ and a potential antagonist vvitii nnmbrane-bound CHEPO polypeptide 
receptors or tecombinant receptors under apprtqmatecoh^^ Tlie CHEPO polypeptide 

can be labeled, such » by radioactivity, such diat die nundiw 

35 be used to determine die effectiveness of die potential antagonist The gene encoding die receptor can be idmtifnd by 
inmierousmedwdskmnimtoduiseofskiintiiieartforexan^ Coinang/a£. Current 
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Protocols in Immun.. 1(2): Chapter 5 (1991). Preferably, expression cloning Is employed wherein polyadenyiated RNA is 
prepared from a cell responsive to the CHEPO polypeptide and a cONA library created fronfi this RNA is divided into pools 
and used to transfect COS cells or other ceils that are not responsive to the CHEPO polypeptide. Transfected ceils that 
are grown on glass sQdes are exposed to labeled CHEPO polypeptide. The CHEPO polypeptide can be labeled by a variety 
5 of means including iodination or inclusion of a recognition site for a site-specific prote'm kinase. Following fixation and 
incubation the slides are subjected to autoradiographic analysis. Positive pools are identified and sub-pods are prepared 
and re-transfected using an interactive sub-pooling and re-screening process, eventually yielding a singte clone that encodes 
the putative receptor. 

As an aitemative approach for receptor ictentification, labeled CHEPO polypeptide can be photoaf finity-finked with 

10 cefl membrane or extract preparations that express the receptor molecule. Cross-linked meterial is resolved by PAGE and 
exposed to X-ray film. The labeled complex containing the receptor can be excised resohred Into peptide fragments, and 
siAjected to protein micro-sequencing. The amino acid sequence obtained from miao- sequencing would be used to des^n 
a set of degenerate oligonucleotide probes to screen a cDNA library to identify the gene encoding the putathre receptor. 
In another assay for antagonists, mammalian cells or a membrane preparation expressing the receptor wouM be 

15 incubated with labeled CHEPO polypeptide in the presence of the candidate compound. The ability of the compound to 
enhance or block this interaction could then be measured. 

More specific examples of potential antagonists include an oligonucleotide that binds to the fusions of 
immunoglobulin with CHEPO polypeptide, and, in particular, antibodies including, without limitation, poly- and monoclonal 
antibodies and antibody fragments, single-chain antibodies, antl-idiotypic antibodies, and chimeric or humanized versions 

20 of sudi antibodies or fragments, as well as human antibodies and antibody fragments. Alternatively, a potential antagonist 
may be a closely related protein, for example, a mutated form of the CHEPO polypeptide that recognizes the receptor but 
imparts no effect thereby competitively inhibiting the action of tiie CHEPO polypeptide. 

Another potential CHEPO polypeptide antagonist is an antisense RNA or ON A construct prepared using antisense 
technology, where, e.g., an antisense RNA or ONA molecule acts to block dlrectiy the translation of mRNA fay hybridizing 

25 to targeted mRNA and preventing protein translation. Antisense technotogy can be used to control gene expression tiirough 
tripb^iefix formation or antisense DNA or RNA, botfi of which methods are based on fainiSng of a polynucleotide to ONA 
or RNA. For examph the 5* coiBng portion of the polynucleotide sequence, which encodes the mature CHEPO polypeptides 
herein, is used to design an antisense RNA oligonucleotide of from about 10 to 40 base pairs in kngdi. A DNA 
oGgonudeotide is designed to be comptonentary to a reghin of the gene hnnrived in transcription (triple heGx • see Lee Bt 

30 flC |facLAcHfeBqfff> & 3073 (1979k Cooney efat Sdema. 241: 456 (19881; Dervan er at Saanca. Ql: 1360 (1991)), 
diereby preventing transcription and the production of the CHEPO polypeptide. The antisense RNA otqonudeotide 
hybridizes to tiie mRNA A Wto and blocks translation of the mRNA molecule mto tiie CHEPO polypeptide (antisense - 
Okam>. Mairochm, SB: 5B0 (1991): OBaodaoxvmicieotides as Antisense hihibitois of Gene Exniessinn (CRC Press: Boca 
Ratmi FU 198Ql The oBgomtdeptides described above cen ebo be delhrered to cells such tiiat tiie antisense RNA or DNA 

35 may be expressed b fmfo to nihibit production of tiie CHEPO polypeptide. When antisense DNA is used. 
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oligodeoxyribonucleotides derived from the translation-initiatioii site, e.g., between about -10 and 10 positions of the 
target gene nucleotide sequence, are preferred. 

Potential antagonists include small molecules that bind to the active site, the receptor binding site, or growth 
factor or other relevant binding site of the CHEPO polypeptide, thereby blocking the normal biological activity of the CHEPO 
5 polypeptide* Examples of small molecules include, but are not limited to, small peptides or peptide-Gke molecules, 
preferably soluble peptides, and synthetic non-peptidyt organic or inorganic compounds. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA. Ribozymes act by 
sequence-specific hybridization to the complementary target RNA, followed by endonucleofytic cleavage. Specific ribozyme 
deavage sites within a potential RNA target can be identified by known techniques. For further details see, B.g^ Rossi, 
10 Current Bioloflv. 4: 469471 (1994), and PCT publication No. WO 97/33551 (published September 18, 1997). 

Nucleic acid molecules in triple-helix formation used to inhibit transcription should be s'mgle stranded and 
composed of deoxynudeotides. The base composition of these oligonucleotides is designed such that it promotes triple- 
helix formation via Hoogsteen base-pairing rules, which generally require sizeabto stretches of purines or pyrimidines on 
one strand of a duplex. For further details see, e.g., PCT publication No. WO 97/33551, Jir/w. 
1 5 These smalt molecules can be identified by any one or more of the screening assays discussed hereinabove and/or 

by any other screening techniques well known for those skflled in the art. 

F. AntiCHEPO Antibodies 
The present invention further provides anti-CHEPO antibodies. Exemplary antibodies include pdlychmat 
2 0 monoclonal, humanized, bispedfic, and heterocoryugate antibodies. 

1. Polvdonal Antibodies 
The anti-CHEPO antibodies may comprise polydonal antibodies. Methods of preparing polyclonal antibodies are 
known to the skilled artisan. Polydonal antibodies can be raised in a manunal, for example, by one or more injections of 

25 an immunizing agent and, if desired, an aiQuvant. Typically, the immunizing agent and/or a^uvant wiQ be mjected in the 
mammal by multlfde subcutaneous or intraperitoneal iiqections. The immunizing agent may indude the CHEPO polypeptide 
or a fusion protein thereof. It may be useful to coiijugate the nnmunizing agoit to a protein known to be inununogenic in 
the manshal beino immurized. Examples of such immunogenic proteins indude but are not fimtted to keyhole limpet 
hemocyanfau serum aBumun, bovnie thyroglobuGn, and soybean trypsin mhibitor. Examples of adjuvants which may be 

30 employed indude FremuTs complete ai^uvant and MPL-TOM adjuvant (monophosphoryl Lipid A, synthetic trehalose 
dicorynomycdate). Tiie immumzathm protocol may be sdected by one skOled in the art without undue experknentation. 

. 2. MonodonalAntibniBaa 
Tin anti^CHEPO antibodies may/ahematively, ba nranodonal antibodies. Monodonal antibodies may be prepared 
35 using bybridoma methods, sudi as those deserted by Kohler and Milstein, Nature. ^^195 (1975). In a hybridoma 
method, a mousa, hamster, or otiier appropriate host annnal, is typkaDy inmuinized wtdi an immunizing agent to elicit 
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iymphocytes that produce or are capable of producing antibodies that will specifically bind to the immunizing agent 
Alternatively, the lymphocytes may be immunized in vitro. 

The immunizing agent will typically include the CHEPO polypeptide or a fusion protein thereof. Generally, either 
peripheral blood lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen cells or lymph node cells are 
5 used if non-human mammalian sources are desired. The lymphocytes are then fused with an immortalized cell line using 
a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell (Goding, Monoclonal Antibodies: Principles 
and Practice, Academic Press, (1986) pp. 59-103]. immortalized cell lines are usually transformed mammalian cells, 
particularly myeloma cells of rodent, bovine and human origin. Usually, rat or mouse myeloma cell lines are employed. The 
hybridoma cells may be cultured in a suitable culture medium that preferably contains one or more substances that inhibit 

10 the growth or survhral of the unf usad, immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine 
guanine phosphorifaosyl transferase (H6PRT or HPRT), the culture medium for the hybridomas typically will include 
hypoxantlnne, aminopterin, and thymidine THAT medium"), which substances prevent the growth of HGPRT-def idem cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high level expression of antibody 
by the selected antibody-praducmg cells, and are sensitive to a medium such as HAT medium. More pref enred immortafized 

15 cell fines are murine myeloma lines, which can be obtained, for instance, from the Salk Institute Cell Distribution Center, 
San Diego, California and the American Type Culture Collection, Manassas, Virginia. Human myeloma and mouse-human 
heteromyeloma cell lines also have been described for the production of human monoclone vitibodies [Kozbor, J. Immunol. 
112:3001 (1984); Brodeur et at Monoclonal Antibody Prmiuction Tc chnioues and Annlicatinns. Marcel Dekker, Im:., New 
Yoric (1987) pp. 51-631. 

20 The ctdtuire medium in wMch the hybridoma eels are cultured can then be assayed for the presme of monodoral 

antibodies directed against CHEPO. Preferably, the binding specificity of monoclonal antibodies produced by the hybridon^ 
ceOs is determined by immunoprecipitation or by an in vitro Unding assay, such as radioimmunoassay (RIA) or enzyme- 
linked imnuinoabsori)ent assay (ELISA). Such techniques and assays are known in the art. The binding affinity of the 
monoclonal antOmdy can« for example, be determined by the Scatchard analysis of Munson and Poflard, Anal. Biochem.. . 

25 192:220(1980). 

After the desired hybridoma c^s are identified, the clones may be subcloned by loniting dflution procedures and 
grown by standard methods (Goding, suoral . Suitable culture media for this purpose inchide, for example, Dulbecco's 
Modified Eagle's Medium and RPMI-1640 meifium. Alternatively, the hybridoma cells may be grown mmoo& ascites in 
a mammal 

30 The monoclonal antibodies secreted by the subclones may be isolated or puri^d from the culture medium or 

ascites flidd by conventional nrnnunoglobufin purification procedures such as, for example, protein A-Sepharose, 
hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography. 

The monoclonal amOiodtes may Also be made by recombinant DNA methods, such as those described in U.S. 
Patent No. 4,816,567. DNA encodmg the monodonai antibo(fies of the invention can be readily isolated and sequenced 

35 uang conventional procedures (e.g., by using oigonudeotnte probes that are capable of binding specifically to genes 
ehcoiSng the heavy and Enchains of murine antibodiesl. The hybridoma cebd the invention serve as a preferred source 
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of such DNA. Once isolated, the DNA may be placed into expression vectors, which are then transfected into host cells 
such as simian COS cells, Chinese hamster ovary (CHO) cells, or myeloma cells that do not otherwise produce 
immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the recombinant host cells. The DNA also may 
be modified, for example, by substituting the coding sequence for human heavy and light chain constant domains in place 
5 of the homologous murine sequences IU.S. Patent No. 4,81 6,567; Morrison et aU suoral or by covalently joining to the 
immunoglobufin codmg sequence all or part of the cod'mg sequence for a non immunoglobulin polypeptide. Such a non- 
immunoglobufin polypeptide can be substituted for the constant domains of an antibody of the invention, or can be 
substituted for the variable domains of one antigen-comb'uilng site of an antibody of the invention to create a chimeric 
bivalent antibody. 

10 The antibodies may be nwnovalent antibodies. Methods for preparing monovatent antibodies are well Icnown in 

the art For example, one method invohes recombinant expression of immunoglobulin light cham and modified heavy chain. 
The heavy chain is truncated generally at any point in the Fc region so as to prevent heavy chain crosslinking. 
Altemathrely. the relevant cysteine residues are substituted with another amino acid residue or are deleted so as to prevent 
crossiinldng. 

15 In vitro methods are also suitable for preparing monovalent antiboifies. Digestion of antibodies to produce 

fragments thereof, particularly. Fab fragments, can be accomplished using routine techniques Icnown in the art 

3. Human and Humanized Antibodies 
The anti-CHEPO antibodies of the invention may further comprise humanized antibodies or human antSiodies. 

2 0 Hionanized forms of non-human (e.g., murine) antibodes are chimeric immunoglobulins, immunoglobufin cha'uis or fragments 
thereof (such as Fv, Fab, Fab*, Rab^ or otiier antigoi-binding subsequences of antibodies) which contain minimal sequence 
derived from non-human immunoglobulin. Humanized antibodies include human immunoglobulins (recipient antibody) in 
which residues from a complementary determining region (CDR) of the recipient are replaced by residues from a CDR of 
a non-human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and capacity. 

25 In some instances, Fv framework residues of tiie human unmunoglobufin are replaced by corresponding non-human 
residues. Humanized antibodies may also comprise resUues whicb are found neither in the recipient antibody nor fai the 
imported COR or framework sequences, bi general, the humanized antibody will comprise substantiaDy al of at least one, 
and typically two, variable domainal in which d or sid^^ 

imniunoglobuiin and all or substantiaDy all of die FR regim are tiiosa of a humai immunogUbufin consensus sequeiu». 
30 The humanized antibody optbnaHy also vriBcomprin at least a portion of a^ 

that of a human immunoglobidin (Jones et aL, Nature. 321:522-525 (1986); Riechmann et al^ Mature. 332323-329 

H9881: and Prtsta. Ciot. Op, Struct BioL 2!593-59B 11992)1 

- Mediods for humanizing non-human antibodies are weQ known in tiie art. Generally, a humanized antibody has 

one or more enuno add residues introduced into it from a source which is non-human. These non-human ammo add 
35 residues are often referred to as "import" resklues, which ere typically taken from an "import" variable domain. 

Humanization can be essentiaOy perf onned f olbwing tiw medud of Winter and co-workers (Jones et aL, Mature. 321^22- 
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525 (1986); Riechmann et al., Nature. 332:323-327 (19881; Verhoeyen et al. Science . 239:1534.1536 (1988)L by 
substituting rodent CORs or CDR sequences for the corresponding sequences of a human antibody. Accordingly, such 
"humanized" antibodies are chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact 
human variable domain has been substituted by the correspond'mg sequence from a non-human species. In practice, 
5 humanized antibodies are typically human antibodies in which some COR residues and possibly some FR residues are 
substituted by residues from analogous sites in rodoit antibodies. 

Human antSiodies can also be produced using various techniques known in the art including phage display 
libraries [Hoogenboom and Winter, J. Mol. BioL 2ZZ:381 (1991); Marks et aL, J. Mol. BioL. 2^:581 (1991)]. The 
techniques of Cole et al. and Boomer et aL are also available for the preparation of human monoclonal antSiodies (Cole et 

10 al., Monoclonal Antibodies and Cancer Thef anv. Alan R. Liss, p. 77 (1985) and Boerner et aL, J. hnmunoL 147(1| :86-95 
(1991)1. Similarly, human antibodies can be made by introducing of human immunoglobulin loci into transgenic animals, 
e.g., mice in which the endogenous immunoglobulin genes have been partially or completely inacthrated. Upon challenge, 
human antibody production is observed, which closely resembles that seen in humans in all respects, including gene 
rearrangement assembly, and antibody repertoire. This apimiach is described, for example, in U.S. Patent Nos. 5,545,807; 

15 5,545,806; 5,569,825; 5.625,126; 5,633,425; 5,661,016, and in the following scientific pubUcations: Marks etal., 
BiolTechnoloflv 10. 779-783 (1992); Lonberg Natwe 368 856-859 (1994); Morrison, Mature 368. 812-13 (1994); 
Fishwildgf a/.. Nature Biotechnology 14. 845-51 (1996); Neuberoer. Nature Biotechnolonv 14. 826 (1996); Lonberg and 
Huszar, intern. Rev. tmmunoL 13 65*93 (1 995). 

20 4. Bispecific Antibodies 

Bispecif ic antibodies are monoclonal, preferably human or humanized, antibodies that have binding specificities 
for at least two different antigens. In the present case, one of the binding specificities is for the CHEPO, the other one 
is for any other antigen, and preferably for a cell-surface protein or receptor or receptor subunit. 

Methods for making bispecific antibodies are known in the art Traditionally, the recombinant production of 

2 5 bispecific antibodies is based on the co^xpression of two immunoglobulin heavy-chain/light-chain pairs, where the two 

heavy chains have different specificities fMOstein end Cuello, Nature. 3^:537-539 (1983)]. Because of the random 
assortment of mmmoglobulin heavy and Gght chains, these hybridomas (quadromas) produce a potential mixture of ten 
different antibody molecules, of which only one has the correct bispedftc structure. The purification of the correct 
molecule is usually accompfished by affinity chromatography steps. Similar procedures are disclosed WO 93108829, 

3 0 published 13 May 1993, and in Traunecker et aL. EMBDJ^ 10:3655-3659 (1991). 

Antibody variabto domams with the desired binding specificities (antibody-antigen combin'mg sites) can ha fused 
to tmmunoglobuBn constant domain sequences. The fusion preferably is vrith ui immunoglobufin heavy-chain constant 
domainr comprising at hast part of the Mnge, CHZ, and CH3 re^ns. it is preferred to have the first heavy-chain constant 
region (CHI) containmg the site necessary for Gght-chdn UniEng present in at least one of the fusions. DMAs encoding 
35 the immunoglobulin heavy-chain fusions and, if dashed, the anmunoglobuGn light chain, are hiserted into separate 
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expression vectors, and are co-transfected into a suitable host organism. For further details of generating bispecific 
antibodies see, for example, Suresh et aL, Methods in EnzYmoloffY - 121:210 (1 986). 

According to another approach described in WO 96/2701 1, the interface between a pair of antibody molecules 
can be engineered to maximize the percentage of heterodimers which are recovered from recombinant cell culture. Hie 
5 preferred interface comprises at least a part of the CH3 region of an antibody constant domain. In this method, one or 
more smaO amino acid side chains from the interface of the first antibody molecule are replaced with larger side chains 
(e.g. tyrosine or tryptophan). Compensatory cavities of identical or similar size to the large side chainis) are created on 
the interface of the second antibody molecule by replacing large amino acid side chains with smaller ones (e.g. alanine or 
threonme). TWs provides a mechanism for mcreasing the yield of the heterodimer over other unwanted end^roducts such 
10 ashomodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g. F|ab bispecific 
antibodies). Techniques for generating bispedTic antibodies from antibody fragments have been described in the fiterature. 
for example, bispecific antSiodies can be prepared can be prepared using chemical finkage. Brennan et a/.. Science 229:81 
(1985) describe a procedure wherein intact antibodies are proteolyttcalty cleaved to generate F(ab jj fragments. These 

15 fragments are reduced in the presence of tiie dithiol complexing agent sodium arsenite to stabilize vicinal dithiols and 
prevent intermolecular disulfide formation. The Fab fragments generated are then converted to thionitrobenzoate (TNB) 
derivatives. One of the Fab TNB derhratives is then reconverted to the Fab -thiol by reduction with mercaptoethylamine 
and is mixed with an equimolar amount of the other Fab -TMB derivathre to form the bispecific antibody. The bispecific 
antibodies produced can be used as agents for the selective immobflization of enzymes. 

20 Fab fragments may be directiy recovered from £ co/i and chemically coupled to form bispecific antaradies. 

Shalaby etal., J. ^xp. Med. 17S:217-225 (1992) describe tiie production of a fully humanized bispecific antibody fiA Is 
molecule. Each Fab fragment was separately secreted from £ w/r and subjected to directed chemical coupling in \ntro 
to form the bispecific antibody. The bispecific antibody thus formed was able to bind to cells overexpressing the ErbB2 
receptor and normal human T cells, as well as trigger the lytic activity of human cytotoxic lymphocytes against human 

25 breast tumor targets. 

Venous technique for making and isolating bispecific antAody fragments directly from recombinant ceil culture 
heve also been described. For example, bispecific antibodies have been produced using leucine zippers. Kostetoy aftf/., 
J. Immunol 148(5):15«7-1553 (1992). The leucine zipper peptides from tiie Fos and Jun proteins were Gnked to tiie Fab 
portions of two Afferent antibodies by gene fusion. The antibody homodnners were reduced at tiie hinge region to form 

3 0 monomers and tiien mxidized to foim tiie antibody heteroifimers. Tliis metiiod cen also be utilized for the production of 
entibodyhomodoim. Tto diabody technology described by Hollinger etaL Proc Mati. Acad. Sci. USA flQ!B44A4MAft 
(1993) has provided an ahemativa mechanism for making bispecific antibody fragments. The fragments cmnprise a 
heavy-chain variable domain (Vh) connected to a Eght cham variable domain (Vi) by a linker which is too short to allow 
pairing between tiie two domains on the same chata. Accordingly, the V„ ami \ domains of one fragment are forced to 

3 5 pair witfi the conqilemaitary and Vg domans of »iotimr fragment, thereby forming two entigen^iimiing sites. Amitiier 
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strategy for making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been reported. See, 
Gruber et 1 tmmunol. 152:5368 (1994). 

Antibodies with mora than two valencies are contemplated. For example, trispecific antibodies can be prepared. Tutt et 

5 Exemplary bispecific antibodies may b'md to two different epitopes on a given CHEPO polypeptide herein. 

Alternatively, an anti-CHEPO polypeptide arm may be combined with an arm which binds to a triggering molecule on a 
leukocyte such as a T-ceB receptor molecute |B.g. CD2. CD3, CD28. or B7). or Fc receptors for IgG (Fc R), such as Fc Rl 
(CD64L Fc Rll (C032) and Fc Rill (CD1 6) so as to focus cellular defense mechanisms to the cell expressing the particular 
CHEPO poiypeptida Bispecific antibodies may also be used to tocafize cytotoxic agents to cells wluch express a particular 
10 CHEPO polypeptide. These antibodies possess a CHEPO-binding arm and an arm which binds a cytotoxic agent or a 
radionucOde chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another bispecific antibody of interest binds the CHEPO 
polypeptide and further binds tissue factor (TR. 

5. Heteroconiuoate Antibodies 

1 5 Heteroconjugate antibodies are also within the scope of the present invention. Heteroconjugate antibodies are 

composed of two covalently joined antibodies. Such antibodies have, for example, been proposed to target immune system 
cells to unwanted cells (U.S. Patent No. 4,676,9801 ami for treatment of HIV infection [WO 91/00360; WO 92/200373; 
EP 03089]. It is contemplated that the antibodies may be prepared m vitro using known methods in synthetic protein 
chemistry, including tiiose involving crossOnking agents. For example, immunotoxins may be constructed using a iBsulfide 

2 0 exchange reaction or by f omiing a tiiioetiier bond. Examples of suitable reagents for this purpose include IminothiolatB and 
methyl4-mercaptobutyrimidate and those disclosed, for exan^le, in U.S. Patent No. 4,676,g8a 

6, Effector Function Enoineering 

It may be desirable to modify tiie antibody of the invention with respect to effector function, so as to enhance, 
2 5 &g., the effecfnreness of the antibody in treating cancer. For example, cysteine residuels) may be introduced into the Fc 
region, tiiereby allowing inti^in disulfide bond formation in tiiis region. The homodimeric antibody tiius generated may 
have improved internalization capabrity and/or mcreased comptement-mediated ceO talng and antibody-dependent cellular 
cytotoxicity (ADCC). See Caron eteL. lExoMed. . J26: 1 191.1195 (1992) and Shopes, J.fanmunoL 148: 2918-2922 
(1992). Homodoneric antibmlies with enhanced anti-tumor activity may also be prepared using heterobifunctional cross- 
30 fakers as described in WoW a^. Camrer Research. S3; 25Bfl.3SfiS n ggay Alternatively, an ant&ody can be engineered 
tiiat has dual Fc regions and may thereby have enhanced complement lysis ami AOCC capabilities. See Stevenson et ai^ 
MCan^MfiBlfllL 3: 219-230 (1989). 
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7. Immunoconiuoates 
The invention also pertains to immunoconjugates comprising an antibody conjugated to a cytotoxic agent such 
as a chemotherapeutic agent, toxin {e.g., an enzymatically active toxin of bacterial fungal, plant or animal origin, or 
fragments thereof), or a radioactive isotope {i.e., a radioconjugate). 
5 Chemotherapeutic agents useful in the generation of such immunoconjugates have been described above. 

Enzymatically active toxins and fragments thereof that can be used include diphtheria A chain, nonbinding active fragments 
of diphtheria toxin, exotoxin A chain (from Pseudomonas asrugmsal ricin A chain, abrin A chain, modeccin A chain, 
Bl^drSBrm^AkuiAesfmlS^Qtmni, dianthin msis\%,PhytQlacaanmicana proteins (PAPI, PAPII, and PAPShmomordica 
charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelonin, mitogetlin, restrictocin, phenomycin, enomycin, 

1 0 and the tricothecenes. A variety of radionuclides are available for the production of radiocoi^ated antbodies. Examples 
include "^Bi, "'I, "V"Y. and '"Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of bif unctional protein-coupling agents 
such as N-succinimidyl'3^2-pyridyldithiol) propionate (SPDP), iminothtolane (ID* bifunctional derivatives of imidoesters 
(such as dimethyl adipinndate HCU active esters (such as disuccinimidyl suberatel, aldehydes (such as glutaraldehyde), 

15 bis-azido compounds (such as bis (p-azidobenzoyi) hexanediamine), bis-diazonium derivatives (such as bis-(p- 
diazoniumbenzoyD-ethylenediam'me), dusocyanates (such as tolyene 2,6-dnsocyanateL and bis-active fluorine compounds 
(such as 1,S-dif luoro-2,4-dinitrobaizene). For ex»nple, a ricin immunotoxin can be prepared as described in Vttetta at al.. 
Science. 238: 1098 (1987). Cartjon U labeled l-isothiocyanatobenzyt-S-methyldiethylene triaminepentaacetic add (MX- 
OTPA) is an exemplary chelating agent for conjugation of radionucleotide to the antibody. See Wp94/1 1026. 

20 In another emboiSment the antibody may be conjugated to a "receptor" (such streptavi Ai) for utization in tumor 

pretargeting wherein the antibody-receptor conjugate is administered to the patient followed by removal of unbound 
conjugate from the circulation using a clearing agoit and then administration of a "ligand* \».g^ avidin) that is conjugated 
to a cytotoxic agent \fi.g., a radionucleotide). 

25 8. fanmunofiposomes 

The antibodies disdosed herein may also be fomndated as immunoOposomes. Liposomes containing the sitibody 
are prepared by methods known in the art sudi as descrBied in Epstein at al^ Proc. Natl. Acad. ScL USA. 82: 3688 (1985); 
Hwang std, Proc Watt Acad. S A USA. 77: 40^ (1980); and U.S. Pat Nos. 4,485,045 and 4,544,545. liposomes with 
enhanced drcuiatlon time are disdosed in U.S. Patent No. 5,01 3,556. 

3 0 Partkidarly useful Gposomes can be generated fay the rev^seiihase evaporationmethod with a lipid compodtion 

compridng phosphatfalylchoiine, diolesterol and PEG-derhratized phosphatidyletbanolamine (PEG-PE). Liposomes are 
extruded through filters of defined pore dze to yidd Eposomes with the desved ifianeter. Fab' f rapents of the antibody 
of the present imrention can be coiqugated to the Gposomes as described in Martiii atal^ J. BioL Chem.. 257: 28&288 
(1982) via a itsuHidraterdiaige reaction. A chemotherapeutic agent (sudh as DoxoniUdn) is optionally contained witiuh 

35 theliposome. See Babizon etaL J, National Cancer Inst.. BH19^ lAM ngflfli 
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9. Pharmaceutical Compositions of Antibodies 
5 Antibodies specifically binding a CHEPO polypeptide identified herein, as well as other molecules identified by 

the screening assays disclosed hereinbefore, can be administered for the treatment of various disorders in the form of 
pharmaceutical compositions. 

If the CHEPO polypeptide is mtraceUular and whole antibodies m used as irihibitors, intemaliraig amibodies are 
preferred. However, lipofections or liposomes can also be used to deliver the antibody, or an antibody fragment, into cells. 
1 0 Where antibody fragments are used, the smallest inhibitory frapent that specifically binds to the binding domain of the 
target proton is preferred. For example, based upon the variable-region sequences of an antibody, peptide molecules can 
be designed that retain the atittty to bind the target prote'm sequence. Such peptides can be synthesized chemically and/or 
produced by recombinant DNA technology. See, e.g., Marasco etat, Proc. Natl. Acad. Sci. USA. SQ^ 7889-7893 
(1 993). The formtdetion herein may also contain more than one active compound as necessary for the partindar imfication 
1 5 being treated, preferably those with complementary activities that do not adversely affect each other. Altemathrely, or 
in addition, the composition may comprise an agent that enhances its function, such as, for example, a cytotoxic agent, 
cytokine, chemotherapeutic agent, or growth-inhibitory agent. Such molecules are suitably present in combination in 
amounts that are effective for the purpose intended. 

The actwe ingredients may also be entrapped in microcapsules prepared, for example, by coacervation techniques 
20 or by interfadal polymerization, for example, hydroxymethylcellulose or gelatinnnicrocapsules and po!y-(methylmethacylate) 
ndcrocapsules, respectively, in colloidal drug delivery systiems (for example, liposomes, albumin microspheres, 
microemulsions, nano-particles, andnanocapsules) or in macroenudsidns. Such techniques are disclosed in Remington's 
Pharmaceutical Sciences, supra. 

The formulations to be used for m mo admuiistration must be sterile. This is readily accomplished by filtration 
25 through sterile filtration membranes. 

Sustained-release preparations may be prepared. Suitable examples of sustained-release preparations include 
semqiermeable matrices of solid hydrophobic polymers containing the antibody, which matrices are in the form of shaped 
articles, a^v films, or microcapsules. Examples of sustained-release matrices include polyesters, hydrogeis (for example, 
poly(2-hydroxyetiiyl-metiiacrylate), or poly(vinylalcoho!)), polylactides (U.S. Pat. No. 3,773,919), copolymers of L-glutamic 
3 0 add and ertiyl-L«glutamate, non-degradable ethylene-vinyi acetate, degradable lactic add-glycolic add copolymers such 
as the LUPRON DEPOT ^ fajectabb mioospheres composed of lactic add-^Sc add copolymer and leuproSde acetatel 
and poly-D-(-)-34iydroxybutyric adi While polymers such as ethylene^ acetate and lactic actd-glycob add enabte 
release of mplecuhs for over 100 days, certain hyifarogels release proteins for shorter time periods. When encapsulated 
amibodies remain in tin body for a long time, they may denatora or aggregate as a 
3 5 resulting in a loss of biologfeal activity and possible changes in immunogenidty. Rational strategies can be devised for 
stabiTization depending on the mechanism mvohred. For exanqde, if tite aggregation mechanism is (Bscovered to be 
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intermolecular S S bond formation through thio-disulfide interchange, stabilization may be achieved by modifying sulfhydryl 
residues, tyophilizing from acidic solutions, controlling moisture content, using appropriate additives, and developing specific 
polymer matrix compositions. 

G. Uses for anti CHEPO Antibodies 

The anti-CHEPO antibodies of the invention have various utilities. For example, anti-CHEPO antibodies may be 
used in diagnostic assays for CHEPO, e,g., detecting its expression in specific cells, tissues, or serum. Various diagnostic 
assay technhiues known hi the art may be used, such as competitive bind'mg assays, direct or incfirect sandwich assays 
and immunoprecipitation assays conducted in either hetero^neous or homogeneous phases (Zola, Monoclonal Antibodies: 
AMaiualofTechninuBs. CRC Press, Inc. (1987) pp. 147-158]. The antibodies used *m the (fiagnostic assays can be labeled 
with a detectable moiety. The detectable moiety should be capable of producing, either directly or indirectly, a detectable 
signal For example, the detectable moiety may be a radioisotope, such as ^N, ^^C, ^P, or a fluorescent or 
chemiluminescent compound, such as fluorescein isothiocyanate, rhodamine, or lucif erin, or an enzyme, such as alkaGne 
phosphatase, beta-galactosidase or horseradish peroxidase. Any method known in the art for conjugating the antibody 
to tiie detectable moiety may be employed, including those metiiods desctftied by Hunter at al.. Nature, 144:945 (1982); 
David at aL, gidiemisiQC,^ 11874); Pain et aU J. Immunol Meth.. 40:219 (1981); and Nygren, J. Histochem. and 
Cvtochem.. 30:407 (19821. 

Anti CHEPO antibodies also are usef id for the affinity purification of CHEPO from recombinant cell culture or 
natural sources. In this process, the antibodies against CHEPO are immobilized on a suitable support, such a Sephadex 
resin or filter paper, using methods well known in the art The immobilized antibody then is contacted with a sample 
containing tiie CHEPO to be purified, and tiiereafter tiie support is washed with a suitable solvent tiiat will remove 
substantiaUy all tiie material in the sample except tiie CHEPO, which is bound to the immobilized antibody. Finally, the 
support IS washed with another suitable solvent that will release the CHEPO from the antibody. 

The following examples are offered for illustrative purposes only, and are not intended to Emit tiie scope of the 
present invention in any way. 

AO patent and literature references cited in the present specification are hereby Incorporated by reference in their 

entirety. 

EXAMPLES 

CommerdaDy avaBable leagoits referred to In the exanvbs were used acconfing to manufacturer's Instructions 
unless otherwise ondicatBd. Tha source of those cells identifU in the following exan^iles, and th^^ 
by ATCC accession numbers is die American Type Culture Collection, Manassas, VA. 
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EXAMPLE 1 
fsoiation of Nudeic Arid Encodino CHEPO 
Genomic ONA was isolated from two chimpanzee cell lines (ATCC CRL1609 and CRL1857) using a Qiagen kit 
(cat#10262] as recommended by the manufacturer's instructions. The chimp Epo gene was then obtained on 3 separate 
5 fragments by PGR using 1 g of genomic DNA as template and the following primer pairs: 
EPO.F: 5*-ACCGCGCCCCCTGGACAG-3' (SEQ ID N0:1 2) 
EP0.IWT1.R: 5'.CATCCACnCTCCGGCCAAACTTCA.3' (SEQ 10 N0:13) 

EP0JNT1F: S'TTTGGCCGGAGAAGTGGATGCT (SEQ 10 N0:14| 
10 EPgj!0ffi:5'-TCACTCACTCACTCAnCAnCAnCAnCA^ (SEQI0N0:15) 

EP0.»rr4F: 5*-GTTGAATGAATGATTGAATGAATGAGTGA.3* (SEQ ID N0:16) 
EPQ.R : S' GCACTGGAGTGTCCATG6GACAG-3' (SEQ ID N0:1 7) 

15 Each PGR reaction contained 5 1 of lOx PGR Buffer (Perkin Elmer), 1 1 dNTP (20mMl 1 g genomic DNA. 1 I of 

each primer, 1 1 of Taq polymerase (Clontech) and H2O to bring the total volume to 50 1. The reaction was first denatured 
for 4 min. at 94 C then amplified for 40 cycles of 1 nun. at 94 C J mia at 65 C or 66 C thai extended for 1 min. at 72 C. 
A last extension step of 5 min. at 72 C was perf ormei The reaction was then analyzed on agarose gel. PGR product of 
SOObp, 1200bp and 750bp were observed for each PGR product respecthrely. The PGR reactions were then pirified using 

20 a Wizard kit (Promega cat # A71 70) then directly sequenced. ONA sequencing of the PGR products was done using an 
Applied Biosystems 377 DNA Sequencer (PE/Applied Biosystems, Foster CHy, CA). The chembtry used was DYE 
Terminator Gycle Sequencing with dRhodamine and BIG DYE terminators (PE/Applied Biosystems, Foster City, CA). 
Sequence assembly and editing done with Sequencher software (Gene Codes, Ann Arbor, Ml). 

The 5 coding exons were identified by homology with the human erythropoietin sequence and assembled into a 

2 5 predicted fuO length cDNA. The coding region of CHEPO cONA is 579 nucleotides long (Figure 1 ) and encodes a predicted 

protein of 193 amino adds (Figure 3). There are 3 putative signal cleavage sites predicted after amino add residue 2Z 24 
and 27. In accordance with the N-terminus of human Epo, the latest one is likely to correspond to the deavage site for 
chimp Epo. The hydrophobic 27 amino acid signal peptide is foDowed by a 166 amino add long mature protein containing 
3 potential N-glycosylation sites. A single nucleotide polymorphism is pr^nt in the predicted sequence obtained from 

3 0 CRL1609 and changes the protein sequmce at anmio add position 142 from a Q to a K. AGgnment of tiia dump Epo protein 

with the human sequence indicates a single change at amino add position 84 (Figure 3). 

EXAMPLE2 
Use of CHEPO cDNA as a hybridization m nha 
35 The following method describes use of a mideetide sequence encoding CHEPO as a hybridization probe. 
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DNA comprising the coding sequence of full-length or mature CHEPO (as shown in Figure Z SEQ ID N0:3) is 
employed as a probe to saeen for homologous DNAs (such as those encoding naturally-occurring variants of CHEPO) in 
human tissue cDNA libraries or human tissue genomic libraries. 

Hybridization and washing of filters containing either library DNAs is performed under the following high 
5 stringency conditions. Hybridization of radiolabeled CHEPO-derived probe to the fitters is performed in a solution of 50% 
formamide, 5i SSC. 0.1% SDS, 0,1% sodium pyrophosphate, 50 mM sodium phosphate. pH 6.8, 2x Denhardt's solution, 
and 10% dextran sulfate at AVZ for 20 hours. Washing of the filters is perfomied in an aqueous solution of 0.1x SSC and 
0.1%SDSat42T. 

DNAs having a desired sequence identity with the DNA encoding fulNength native sequence CHEPO can then be 
1 0 identified using standard techniques Icnown in the art. 

EXAMPifS 
Expression of CHEPO in E toli 
This example illustrates preparation of an unglycosylated form of CHEPO by recombinant expression in £ coS. 
15 The DNA sequence encoding CHEPO (SEQ ID N0:3) is mitiaily amplified using selected PCR primers. The primers 

should contain restriction enzyme sites which correspond to the restriction enzyme sites on the selected expression vector. 
A variety of expression vectors may be employed. An example of a suitable vector is pBR322 (derived from £ raft see 
Bolivar et al., Gene, 2:95 (1 977)) which contains genes for amptdllin and tetracycline resistance. The vector is cfigested 
with restriction enzyme and dephosphoryiatei The PCR ampfified sequences are then ligated into the vector. The vector 
20 wiO preferably include sequences whh:h encode for ah antibiotic resistance gene, a trp promoter, a polyhis leader finchiding 
the first six STII codons, polyhis sequence, and enterokinase deav^e site), the CHEPO coding region, lambda 
transcriptional temiinator, and an argU gene. 

The Ggation mixture is then used to transform a selected £ colit\xm using the methods described in Sambrook 
et aL, Mil- Transf ormants are identified by their ability to grow on LB plates and antibiotic resistant colonies are tiien 

2 5 selected. Plasmid DNA can be isolated and confirmed by restriction analysis and DNA sequencing. 

Selected dones can be grown overnight in Gqiad culture medium such as LB brotii supplemented widi antibiotics. 
The overnight culture may subsequently be used to inqcuiata a larger scaie cultore. The ceDs are tiien grown to a desired 
optical density, during which the expression promoter is tunted on. 

After ndturing the cells for several more hours, tiie cells can be harvested by centrif ugation. The cell peflet 

3 0 obtaiimd by tin centrifugation can be sdubiBzed using various agents known ta tiie aa end tfie sohibOized CHEPO protein 

can tiien be purified usmg a metal chelating cohmm under conditions timt allow tight binding of tiie protein. 

CHEPO may be expressed in £ eoS'wi a pofy-Ks tagged f qnn, using die f oDowing procedure. The DNA encocbig 
CHEPO is initialyanqdified using sriected PCR primers. The prinrnvvQI contain reimiction enzyme sites vvhichn 
to tiie restrfetnn enzyme sites on the selected expresskm vector, end otiier useful sequences providng for efficient and 
3 5 reBdble translation initiation, rapid purification on a metel chelation cohimn, and proteolytic removel widi enterokoiasa. 
The PCR-empSried, poly-His tagged sequences aie tiien Egated Into an expression vector, wMcb is osed to transform an . 
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£ wA^host based on strain 52 |W31 10 fuhAltonA) Ion galE rpoHts(htpRts) clpP(laclq). Transformants are first grown in 
LB containing 50 mglml carbanicillm at 30 C with shaking until an 0.0.600 of 3-5 is reached. Cultures are then diluted 
50-100 fold into CRAP media (prepared by mixing 3.57 g [UH^^^Q^ 0.71 g sodium ciuate 2H2a 1.07 g KCI, 5.36 g Difco 
yeast extract 5.36 g Sheffield hycase SF in 500 mL water, as well as 110 mM MPOS, pH 7.3, 0.55% (w/v) glucose and 
5 7 mM MgSOJ and grown for approximately 20-30 hours at 30 C with shaidng. Samples are removed to verify expression 
by SDS-PAGE analysis, and the bulk culture is centrif uged to pellet the cells. Cell pelieu are frozen until purification and 
refolding. 

£ cuff paste from 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volumes <w/v) in 7 M guanidine, 
20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetrathionate is added to make final concentrations of 0.1 M 

10 and 0.02 fM, respectively, and the solution is stirred overnight at 4 C. This step results In a denatured protein with all 
cysteine residues blocked by sulfltolization. The solution Is centrif uged at 40,000 rpm in a Beckman Ultracentlfuge for 30 
nun. The supernatant is diluted with 3-5 volumes of metal chelate column buffer (6 M guanidina, 20 mM Tris, pH 7.4) and 
filtered through 0.22 micron filters to clarify. The clarified extract is loaded onto a 5 ml Qiagen Ni-NTA metal chelate 
column equilibrated in the metal chelate column buffer. The column is washed with additional buffer containing 50 mM 

15 imidazole (Caibiochem. Utrol grade), pH 7.4. The protein is eluted with buffer containing 250 mM imidazole. Fractions 
containing the desired protein are pooled and stored at 4 C. Protein concentration is estimated by its absorbance at 280 
nm using the calculated extinction coefficient based on its amino acid sequence. 

The proteins are refolded by diluting the sample slowly into freshly prepared refolding buffer consisting of. 20 
mMTris,pH8.6, 0.3MNaCl2.SMurea,SmMcystein8i^20mMglycmeand1mMm^ Refolding volumes are chosen 

20 so that the final protein concentration is between 50 to 100 mlcrogramsfmL The refolding sohition b stirred gently at 4 C 
for 12-36 hours. The refolding reaction is quenched by the addition of TFA to a final concentration of 0.4% (pH of 
approximately 3). Before furtiier purification of the protein, the solution is filtered through a 0.22 micron filter and 
acetonitrile is added to 2-10% final concentration. The refolded protein is chromatographed on a Poros R1/H reversed 
phase column using a mobile buffer of 0.1% TFA with elution with a gradient of acetonitrile from 10 to 80%. Aliquou 

25 of fractions with A280 absortiance are enalyzed on SIS polyacrylamide gels and fractions containing homogeneous 
refolded protem are pooled. Generally, the property refolded spedes of most proteins ara eluted at the lowest 
concentrations of acetonitrile since those species are the most compact with their hydrophobic interiors shielded from 
interaction with the reversed phase resin. Aggregated species are usually ekrted at lugheracetonitnle concentrations. In 
addition to resolving misfolded forms of proteins from tiie desired f orni, tiie reversed phase step also removes endotoxin 

30 from tiie samples. 

Fractions containiiQ die desired folded CHEPO polypeptide are pooled and tiie acetonitrile ronoved ushq a gentie 
stream of nitrogen directed at tfie sohithm. Proteins ere formulated mto 20 mM Hqies, pH 6.8 with 0.14 M sodhmi 
chloride and 4% mamutol by dialysis or by gel filtration u»ng G25 Superf aie (Pharniacia) resins equilibrated in tiie 
formulation buffer and sterile filtered. 

35 
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EXAMPLE 4 
Expression of CHEPO in mammalian ceils 
This example illustrates preparation of a potentially glycosylated form of CHEPO by recombinant expression in 
mammalian cells. 

5 The vector. pRK5 (see EP 307,247, published March 15, 1989), is employed as the expression vector. 

Optionally, the CHEPO DNA is ligated into pRK5 with selected restriction enzymes to allow insertion of the CHEPO DNA 
using ligation methods such as described in Sambrook et al., supra. The resulting vector is called pRKS CHEPO. 

In one embodiment the selected host cells may be 293 ceDs. Human 293 cds (ATCC CCL 1S73) are grown to 
confhience in tissue cidture plates in murium such as DMEM supplemented with fetal calf serum and optionally, nutrient 

1 0 components andlor antibratics. About 1 0 g pRK5-CHEP0 DNA is mixed with about 1 g DNA encodmg the VA RNA gene 
rrh'mimappaya et aL, Ceg, 31:543 (1982)1 and iBssolved in 500 1 of 1 mM Tris-HCl, 0.1 mM EDTA, 0.227 M CaCI,. To 
this mixture is added, dropwise, 500 i of 50 mM HEPES (pH 7.35). 280 mM NaCI, 1.5 mM NaPOf, and a precifritate is 
allowed to form for 10 minutes at 25*C. The precipitate is suspended and added to the 293 cells and allowed to settle 
for about four hotffs at 37*C. The culture medium is aspirated off and 2 ml of 20% glycerol In PBS Is added for 30 

1 5 seconds. The 293 cells are then washed with senmi free medium, fresh medium is added and the cells are incubated for 
about 5 days. 

Approximately 24 hours after the transf ections, the culture medium b removed and r^aced with culture mednni 
(alone) or culture medium containing 200 Ci/nd ^S*cysteine and 200 Ci/ml ^S-methionine. After a 12 hour incubation, 
the conditioned medium is collected, concentrated on a sp'ui f ilter, and loaded onto a 15% SDS gel The processed gel may 
20 be dried and exposed to fOm for a selected period of time to reveal the presence of CHEPO polypeptide. The cultures 
containing transf ected cells may undergo further mcubation fm serum free medium) and tiie medium is tested m selected 
faioassays. 

In an alternative technique, CHEPO may be introduced into 293 cells transiently using tiie dextran sulfate method 
described by Somparyrac et al. Proc Natl. Acad. Sci.. 12:7575 (1981). 293 cells are grown to maxnnal density in a 

25 spinner flask and 700 g pRKS-CHEPO DNA is added. The ceHs are first concentrated from the spinner flask by 
centrifugation and washed witii PBS. The DNA-dextran precipitate is bcubated on die cell pellet for four hours. The cells 
are treated wifli 20% glycerol for 90 secondSr washed with tissue culture medium, and reModuced into the spinner flask 
containii^i tissue culture meium, 5 gfml bovine oisulin and 0.1 gjml bovine transferrin. After about four days, the 
conditioned media is centrifuged and filtered to remove cells and debris. The sample contahiing expressed CHEPO can tfien 

30 be concentrated and purified by any selected method, such as ifialysts and/or cohann chromatography. 

In anottier enibodiment CHEPO can be expressed in CKO ceh. The pRKS-CHff 0 can be transfected bito CHO 
ceUs using known reagents such as CaP04 or DEAE-dextran. As descrBied abovOr the cd cultures can be tnodiatBi and 
the medhim replaced vvidi cidture mnlium (abne) or mediian contanilhg a radhilabel such as ^^Smthionme. After 
detenninmg tin presence rf CHEPO polypeptRie,tiieGidturBm^^ Preferably, 

35 tin cuhum are mcubated for about 8 days, and tiientimconditk^ The medium contuntag the 

expressed CHEPO can tiien be concentrated and purified by my selectiBd mediod. 
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EpHopHagged CHEPO may also be expressed in host CHO cells. The CHEPO may be subdoned out of the pRKS 
vector. The subclone insert can undergo PGR to fuse In frame with a selected epitope tag such as a poly-his tag into a 
Baculovirus expression vector. The poly-his tagged CHEPO insert can then be subdoned into a SV40 driven vector 
containing a selection marker such as DHFR for selection of stable dones. Finally, the CHO cells can be transf ected (as 
5 described above) with the SV40 driven vector. Labeling may be performed, as described above, to verify expression. The 
culture medium containing the expressed poly-His tagged CHEPO can then be concentrated and purified by any selected 
method, such as by NP'-chebte afruAy chromatography. 

CHEPO may also be expressed in CHO andfor COS cells by a transient expression procedure or in CHO cells by 
another stable expression procedure. 
1 0 Stable expression in CHO celb is performed using the following procedure. The proteins are expressed as an IgG 

construct (rnimunoadhesin), in which the coding sequences for the soluble forms (e.g. extracellular domains) of the 
respecthre proteins are fused to an IgGI constant region sequence containing the hinge, CH2 and CH2 domains and/or is 
a poly-His tagged form. 

Following PCR amplificatioa the respective ONAs are subdoned in a CHO expression vector using standard 
15 techniques as describol in Ausubel et al.. Current Protocols of Molecular Bioioov. Unit 3.18, John Wiley and Sons (1997). 
CHO expression vectors are constructed to have compatible restriction sites 5 and 3 of the ONA of interest to allow the 
convenient shuttling of cDNA s. The vector used expression in CHO cells is as described in Lucas et al.. Mud. Acids Res. 
24:9 (1774-1779 (1996), and uses the SV40 early promoter/enhancer to drive expression of the cDNA of interest and 
dihydrof olate reductase (DHFR). DHFR expression permits selection for stable maintenance of the plasmid following 
20 transf action. 

Twelve micrograms of the deared plasmid ONA is introduced into approximately 10 million CHO cells using 
commerdally available transfection reagents Superfect (Quiagen), Dosper or Fugene (Boehringer Mannheim). The cells 
are grown as described in Lucas et aL, sygra, Approximately 3 x 10 ' ceOs are frozen in an ampule for further growth and 
production as described below. 

25 Tte ampules containing the plasmid ONA are thawed by placement into water bath and nuxed by vortexing. The 

contents are pipetted mio a centrifuge tidie contdnbig 10 mLs of media and centrifuged at 1 000 rpm for 5 minutes. The 
supernatant is aspirated and the ceOs are resuspended in 10 mL of selective media (02 m filtered PS20 with 5% 0.2 m 
diafiitered fetal bovine serum). The cells are then aGquoted into a 1 00 mL spfamer contammg 80 mL of selective media 
After 1*2 day^ the ceOs are transferred bito a 250 mL spinner filled widi 150 mL selective growdi medium and incubated 

30 at37%. After anotiier 2-3 d8y$, 250 mU SOD mL and 2000 mL simmers ere seeded TheceDmeifia 
is exdiangadwidi fresh media by cemrifugationa^ Attiiough any suitable CHO mecSa 

may be employed, a production meAmi described m U.S. Patent No. 5,122,469, issued ^ 16, 1992 may actually be 
used. A 3L production spinnv is seeded at 1.2 xlOFcelhlM.^0^ Onday1,the 
simnv is san^U and spaipq with filtered air is commas On day 2, tin spmner Is sampled, the tenqwrature shifted 

35 to 33T, and 30 mL of 500 gfL ghicbse and 0.B mL of 10% antifoam (e.g., 35% polydimetiiylsiloxane emulsion, Oow 
Corning 365 Medical Grade Emulsion) taken. Ttooughont tiie production, the pH is adjusted as necessary to keep it at 
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around 7.1 After 10 days, or until the viability dropped below 70%, the cell culture is harvested by centrifugation and 
filtering through a 0.22 m filter. The filtrate was either stored at 4*C or immediately loaded onto columns for purification. 

For the poty His tagged constructs, the proteins are purified using a Ni-NTA column (Qiagen). Before purification, 
imidazole is added to the conditioned media to a concentration of 5 mM. The conditioned media is pumped onto a 6 ml Ni- 
5 NTA coiunnn equilibrated in 20 mM Hepes, pH 7.4. buffer containing 0.3 M NaCl and 5 mM imidazole at a flow rate of 4-5 
ml/min. at 4*'C. After loading, the column is washed with additional equilibration buffer and the protein eluted with 
equlGbration buffer containing 0.25 M imidazole. The highly purified protein is std)sequently desalted into a storage buffer 
containing 10 mM Hopes, 0.14 M NaCI and 4% mannitol, pH 6.8, with a 25 ml 625 Superfina (Pharmacia) column and 
stored at -BOX. 

1 0 Immunoadhesin (Fc containmg) constructs are purified from the conditioned media as follows. The conditioned 

medhmi is pumped onto a 5 ml Protein A column (Pharmacia) winch had been equilibrated in 20 mM Na phosphate buffer, 
pH 6.8. After loading, the cohimn is washed extensnrety with equilibration buffer before elution with 1 00 mM citric acid, 
pH 3.5. The eluted protein is immediately neutrafized by collecting 1 ml fractions into tubes containing 275 L of 1 M Tris 
buffer, pH 9. The Mghly purified protein is subsequently desalted into storage buffer as described above for the poly-His 

1 5 tagged proteins. The homogeneity is assessed by SOS polyacrylamide gels and by N-terminal amino acid sequencing by 
Edman degradation. 

EXAMPLES 
Expression of CHEPD in Yeast 
20 The foOowing method describes recombinant expression of CHEPO in yeast. 

Hrst yeast expression vectors are constructed for intracellular production or secretion of CHEPO from the 
A0H2/GAPDH promoter. DNA encoding CHEPO and the promoter is inserted into suitable restriction enzyme sites in the 
selected plasmid to direct intracellular expresnon of CHEPO. For secretion/ONA encoding CHEPO can be cloned into the 
selected plasmid, together with DNA encoding the ADH2/GAPDH promoter, a native CHEPO signal peptide or other 

2 5 mammalian signal peptide, or, for example, a yeast alpha-factor or invertase secretory signal/leader sequence, and linker 

sequences Of needed) for expression of CHEPO. 

Yeast ceSs, such as yeast stram AB1 10, can then be transformed with the expression plasmids described above 
, and cultured n sdected feimentation meia. the transfonned yeast sopematants can be enalyzed by precipitation with 
10% trichloroacetic add and separation by SDS4^A6E, foOowed by staining of the gels with Coomassie Bhie stabu 

3 0 Recomboiant CHEPO csi subsequently be isolated and purified by removing the yeast ceBs from the fermentation 

medum by centrif ugatum and then cimcentratmg the medium vang sdected cartridp fatera. The concentrate containing 
CHEPO may further be purified using selected cohimn chromatography resins 

EXAMPLE6 

35 Expression of CHEPO in Baculovirus-infected Insect Cafls 

Iba fonowiiq method descr^ recomUnant expression of CHEPO in Bacuiovinis-aifected insect cells. 
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The sequence coding for CHEPO is fused upstream of an epitope tag contained witfiin a baculovirus expression 
vector. Such epitope tags include poiy-his tags and iiraminoglobufin tags (liice Fc r^ions of IgGK A variety of plasmids may 
be employed, including plasmids derived from commercially available plasmids such as pVL1393 (Novagen). Briefly, the 
sequence encoding CHEPO or the desired portion of the coding sequence of CHEPO such as the sequence encoding the 
5 extraceOular domain of a transmembrane protein or the sequence encoding the mature protein if the protein is extracellular 
is amprrfied by PCR with primers complementary to the 5* and 3' regions. Tbe 5' prener may incorporate flanking (selected) 
restriction enzyme sites. The product is then digested with those selected restriction enzymes and subcloned into the 
expression vector. 

Recombinant baculovirus is generated by co-transfecting the above plasmid and BacuioGold^ virus ONA 

10 (PharmingenI into Spodt^t&a frug^&da fSfS") cells (ATCC CRL 1711) using lipofectin (commercially available from 
GIBCO-BRL). After 4 • 5 days of incubation at 28%, the released viruses are harvested and used for further amplifications. 
Viral infection and protein expression ar« performed as described by CReHley et al.. Baculovirus exDression vectors: A 
Laboratory Manual. Oxford: Oxford University Press (1 994). 

Expressed poty-his tagged CHEPO can then be purified, for example, by NP^-chelate affinity chromatography as 

1 5 follows. Extracts are preparol from recombinant virus-infected Sf 9 cells as descr&ed by Rupert et at, Nature. 362 :1 75- 
179 (1993). Briefly. Sf9 ceUs are washed, resuspended in sonlcation buffer (25 ml Hopes, pH 7.9; 12.5 mM MgCl,; 0.1 
mM EDTA; 10% glycerol- 0.1% NP40; a4 M KCt), and sonicated twice for 20 secomb on ice. The sonicates are cleared 
by centrifugation, and the supernatant is diluted 50-foU in loacfing buffer (50 mM phosphate, 300 mM NaCI, 10% glycerol 
pH 7.8) and filtered through a 0.45 m fftv. A Iti'^-NTA agarose column (commerdally avaidila from Qiagen) is prepared 

20 with a bed volume of 5 mU washed with 25 mL of water and mpdibrated with 25 ml of loading buffer. Ttie filtered ceB 
extract is loaded onto the column at 0.5 ml per minute. The column is washed to baseSne Azn with loading buffer, at 
which point fraction collection is started. Next the column is washed with a secondary wasfi buffer (SO mM phosphate; 
300 mM NaCl 10% glycerol pH 6.0), whidi elutes nonspecifically bound protein. After reaching Ajgobaseline again, the 
cohmm is developed with a 0 to 500 mM Imidazole gradient in the secondary wash buffer. One ml fractions are collected 

25 and analyzed by SOS-PAGE and silver staining or Western blot with Ni^'-NTA-coi^gated to alkaGne phosphatase (Qiagen). 
Fractions conta'ming the ehited HiSiQ-tagged CHEPO are pooled and dialyzed agairst loading buffer. 

Alternatively, purification of tiie ^G tqged (or Fc tagged) CHEPO can be pofomied using loiown dranatography 
techniques, inchiding for instance. Protein A or protein 6 cohmin chromatography* 

30 EXAMPLE 7 

Prenaration of Antibodies tfiat Bind CHEPO 
TMs exaiqile ihistrates preparation of monoclonal antibodies which can specifically bind CHEPO. 
Tedinniues for producing tlie monoclonal antibodies m known in tin art and are descnbed, for mstance, in 
Coding, syfiQ. Immunogens tiiat may be employed include purifiad CHEPO, fusion proteins containmg CHEPO, and cells 
35 expressing lecombnant CHEPO on tiie cell surface. Sdection of tiuimmunogen can be made by tiiesUkd artisan widiout 
undue experimentation. 
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Mice, such as Balb/c. are immunized with the CHEPO immunogen emulsified m complete Freund's adjuvant and 
injected subcutaneously or intraperitoneally in an amount from MOO micrograms. Alternatively, the immunogen is 
emulsified in MPL-TDM adjuvant (Rib! Immunochemical Research. Hamilton, MI) and injected into the animal's hind foot 
pads. The immunized mice are then boosted 10 to 12 days later with additional immunogen emulsified m the selected 
5 adjuvant. Thereafter, for several weeks, the mice may also be boosted with additional hnmunization injections. Serum 
samples may be periodically obtained from the mice by retro-orbital bleeding for testing in EUSA assays to detect anti- 
CHEPOantaiodies. 

After a suitable antibody titer has been detected, the animals "positive" for antibodies can be injected with a 
final intravenous injection of CHEPO, Three to four days later, the mice are sacrificed and the spleen cells are harvested. 
10 The spleen cells are then fused (using 35% polyethylene glycol) to a. selected murine myeloma cell line such as 
P3X63AgU.1, avaHable from ATCC, No. CRL 1597. The fusions generate hybridoma cells which can then be plated in 96 
waB tissue culture plates containing HAT {hypoxanthlne, aminopterin, and thymidine) medium to witvbit proliferation of non- 
fused cells, myeloma hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened in an EUSA for reactwity against CHEPO. Determination of "positive" 
15 hybridoma cells secreting the desired monoclonal antibodies against CHEPO is within the skill in the art 

The posithre hybridoma ceils can ba mjected intraperitoneally into syngeneic Balb/c mice to produce ascites 
containing the anti-CHEPO monodonai antibodies. Altemativdy, the hybridoma calls can be grown ui tissue culture flasks 
or roller bottles. Purification of the monoclonal antibodies produced in the ascites can be accomplished using ammonium 
sulfate precipitation, followed by gel exchision chromatography. Altemathrely, affinity chromatography based upon biniBng 
20 of antibody to protebi A or protemG can ba employed. 

EXAMPLES 

Purification of CHEPO Polvnentides Usino Specific Antihodies 
Natwe or recombinant CHEPO polypeptides may be purified by a variety of standard techniques in the art of 

2 5 protein purification. For exsnple, pro CHEPO polypeptide, mature CHEPO polypeptide, or pre^JHEPO polypeptide is purified 

by onmunoaffmity chromatography using antibodies specific for the CHEPO polyp^tida of interest In genml an 
onmunoaffinity column is constructed by covalently coupling tha anti-CHEPO polypeptide antibody to an activated 
chromatographic resin. 

Polycbnal immunoglobulins are prqiared from immune sera ether by precipitation vrith ammonium sulfate or by 

3 0 purification on immobiBzed Protein A (Pharniada LKB Biotechnology, Piscataway, N J,). LQcewise, monoclonal antibodies 

are prepared from mouse ascites f lud by ammomum sulfate precipitetion or chromatography on immobiTaed Protein A. 
Partially purified hununoglobulln is covalently attached to a chromatographic resin such as CiiBr-activated SEPHAROSE^ 
(Phannada LKB Biotechnology). The antibody is coupled to the rasbi, the resin is blocked, and tha derivative resin is 
washed acconbig to the manufacturer's bistnictions. 
35 Sudi an nnmunoaffinity cohm b utilized in the purification of CHEPO polypeptide by preparing a fraction from 

ceils contahung CHEPO polypeptide in a sohibla form. This preparation is derived by sohjUBzation of tha whole ceO or of 
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a subcellular fraction obtained ^a differential centrif ugation by the addition of detergent or by other methods well known 
in the art Alternathrely« soluble CHEPO polypeptide containing a signal sequence may be secreted in useful quantity mto 
the medium in which the ceils are grown. 

A soluble CH£PO polypeptide-containing preparation is passed over the immunoaffimty column, and the column 
5 is washed under conditions that allow the preferential absorfaance of CHEPO polypeptide {fi.g„ high ionic strength buffers 
in the presence of detergent). Then, the cohimn is eluted under conditions that disrupt antibodyfCHEPG polypeptide binding 
(ff.^., a low pH buffer such as approximately pH 2-3, or a high concentration of a chaotrope such as urea or thiocyanate 
ion), and CHEPO polypeptide is collected. 

10 EXAMPLES 

DruQ Screening 

This invention is particularly useful for screening compounds by u»ng CHEPO polypeptides or binding fragment 
thereof in any of a variety of (bug screening techniques. The CHEPO polypeptide or fragment employed in such a test may 
either be free in sohition, affixed to a sojid support borne on a cell surface, or located intracelNarly. One method of drug 

15 screening utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant nucleic adds 
expressing the CHEPO polypeptide or fragment Drugs are screened aga'uist such transformed cells in competitive binding 
assays. Such cells, eth» in viable or fixed form, can be used for standard binding assays. One may measure, for examplar 
the fonnation of complexes between CHEPO polypeptide or a f repent and the agent being tested. Alternatively, one can 
examina the dinunution in complex formation between the CHff 0 polypeptide and its target cell or target receptors caused 

20 by the agent bdng teste! 

ThuSr the present invention provides methods of screening for drags or any other agents which can affect a 
CHEPO polypeptide-essociated disease or disorder. These nnthods comprise contacting such an agent with an CHEPO 
polypeptide or fragment tiiereof end essaying (I) for the presence of a complex between the agent and the CHEPO 
polypeptide or fragment or (ii) for the presence of a complex between the CHEPO polypeptide or fragment and the cell, 

25 by methods well known m the art In such competithre binding essays, the CHEPO polypeptide or fragment is typicelly 
labeled. After suitable mcubation, free CHEPO polyimptide or fragment is separated from tiiat present in bound form, end 
the amount of free or uncomplexed labd is a measure of tiie ability of the particular agent to bind to CHEPO polypeptide 
or to interfere witi) tiie CHEPO polypeptidefceU complex. 

Another technique for drug screening provides high throughput screenmg for compounds having suitable btncGng 

30 affiraty to a polypeptide end is (bscribedb detail m Briefly stated, large 

numbers of different smaQ peptUe test compounds are syntfiasized on a solid substra^ 
surface. As applied to a CHEPO polypeptUe, tiie peptide test compounds are reacted with CHEPO polypeptide and washed. 
Bound CHBPO polypqitide b detected by metiiods weD known h the art Purified CHEPO polypeptide can also be coated 
ffirectly onto plates for use in tiie aforementioned drug screering techniques. In addition, non^ieutralizing antibodies can 

35 be used to cepture the peptide and immobilize it on the solid support 
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This invention also contempiates the use of competitive drug screening assays in which neutralizing antibodies 
capable of binding CHEPQ polypeptide specificaily compete with a test compound for binding to CHEPO polypeptide or 
fragments thereof. In this manner, the antibodies can be used to detect the presence of any peptide which shares one or 
more antigenic determinants with CHEPO polypeptide. 

5 

EXAMPLE 10 

Rational Drug Design 

The goal of rational ihig des'p is to produce stmctural analogs of biologically active polypeptide of interest (la, 
a CHEPO polypeptide) or of small molecules with which they interact, e.g., agonists, antegonists, or inhibitors. Any of 

1 0 these examples can be used to fashion drugs which are more active or stable f ornis of the CHEPO polypeptide or which 
enhance or interfere with the function of the CHEPO polypeptide m mo {p.K Hodgson, Bioffechnoloov. ft 19-21 (1991)). 

In one approach, the three^imensional structure of the CHEPO polypeptide, or of an CHEPO pdypeptide-inhaiitor 
complex, is determined by x*ray crystallography, by computer modeBng or, most typically, by a combinathin of the two 
approaches. Both the shape and charges of tiie CHEPO polypeptide must be ascertained to elucidate the structure end 

15 to determine active site(s) of the molecule. Less often, usefid mf oimation regarding the structure of the CHEPO polypeptide 
may be gamed by modeling based on the structure of homologous proteins. In both cases, relevant structural mf ormation 
is used to desqn analogous CHEPO polypeptide^ke molecules or to iibntify effidoit inhibitors. Usefid examples of rational 
drug design may include molecules which have improved activity or stability as shown by Braxton and Wells, Biochematrv. 
31:77967801 (1992) or which act as inhibitors, agonists, or antagonists of native peptides as shown by Athauda std., 

2 0 IBiochent. 113:742746 (1993). 

It is also possible to isolate a target-specific antibody, selected by functional assay, as described above, and tiien 
to solve its crystal structure. This approach, in principle, yields a pharmacore upon which subsequent drug design can be 
based. It is possible to bypass protein crystallography altogether by generating anti-idiotypic antibodies (anti-ids) to a 
functional pharmacologically active antibody. As a mirror image of a mirror image, tiie bind'mg site of the anti-ids would 

25 be expected to be an analog of tha original receptor. The anti-id could tiien be used to identify and isolate peptides from 
banks of chemica&y or biotogicaily produced peptides. The isolated peptides would tiien act as tiie phamfiacore. 

By virtue of the present invention, sufficient amounts of tiie CHEPO polypeptide may be made available to 
perform such analyticel studies as X-ray crystaOography. In addition, knowledge of tiie CHEPO polypeptide amino acid 
sequence provided herein wiD provide guidance to tiuise employbg computer modefing technniues in place of or in adiBtion 

30 to x-ray ctystaHograpby. 

The f oregobig vvritten spedfication b consnhred to be suffidem to enable 
invention as dabned. Various modificathms of the invemion in addition to tiiose shown end de^ 
apparent to tiiose sUled in tiie ert from tiie foregoing description and f aO whiun tiu scope of the appended claims. 
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WHAT IS CLAIMED tS: 

1 . An isolated mideic acid mQlecde having at least about 80% nucleic acid sequence Identity to (a) a DNA 
molecule encotfing a CHEPO polypeptide comprising the sequence of amino add residues from about 1 or about 28 to about 
193 of Figure 2 (SEQ ID N0:2L or (b) the complement of the DNA molecule of (a). 

2. The isolated nucleic acid molecule of Claim 1 comprising nucleotides 1 or about 82 to about 579 of 
Figure 2 (SEQ ID N0:3). 



3. The isolated nudeic acid molecule of Claim 1 comprising the nudeotide sequence of F^ure 2 (SEQ ID 

10 N0:3). 

4. The isolated nucleic add molecule of Clann 1 comprising a nucleotide sequence that encodes the 
sequence of amino add residues from about 1 or about 28 to about 193 of Figure 2 (SEQ ID N0:2). 

15 5. An isolated nudeic add moloujle encoding a CHEPO polypeptkle comprising DNA that hybridizes to the 

complement of the nudeic acid sequence that encodes amino adds 1 or about 28 to about 193 of Figure 2 (SEQ ID N0:2). 

8. The isdated nucleic add molecule of Claim 5, wherein the nudeic add that encodes amino adds 1 or 
about 28 to about 1 93 of Hgure 2 (SEQ ID N0:2) comprises nudeotides 1 or about 82 to about S79 of Figure 2 (SEQ ID 

20 N0:3). 

7. The isolated nudeic add molecule of Claim 5, wherein the hybridization occurs under stringent 
hybridization and wash conditions. 

25 8. An isolated nucleic add molecuie comprising (a) DNA encoding a polypeptide scoring at least 80% 

posithres when convared to the sequence of ammo add reddues of to^ 
N0:2), or (b) the conqriement of the DNA of (a). 

9. A vector comprising the nudnc add mdecub of Claim 1. 

30 

10. The vector of CIdm 9, wherein said micleic add molecub Is operably Imked to control sequences 
recognized by a host cell transfoimed with the vector. 

11. A host cell comprising the vector of Chum 9. 

35 

1Z The host cell of Clabn 11, wherein nidceO is a CHO cell 
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13. The host cell of Claim 1 1, wherein said cell is an £ coS. 

14. The host cell of Claim 1 1 , wherein said cell is a yeast cell. 

5 1 5. A process for producing a CHEPO polypeptide comprising culturing the host cell of Claim 1 1 under 

conditions suitable for expression of said CHEPO polypeptide and recovering said CHEPO polypeptide from the cell culture. 

16. An isolated CHEPO polypeptide comprising an amino acid sequence comprising at least about 80% 
sequence identity to the sequence of anano acid residues from about 1 or about 28 to about 193 of Figure 2 (SEQ ID N0:2). 

10 

17. The isolated CHEPO polypeptide of Claim 16 comprising amino acid residues 1 or about 28 to about 
193ofRgure2(SEQIONO:2L 

18. An isolated CHEPO polypeptide scoring at least 80% positives when compared to the sequence of 
15 amino add residues from 1 or about 28 to about 193 of Hgwe 2 (SEQ ID N0:2). 

19. An isolated CHEPO polypeptide comprising the sequence of amino acid residues from 1 or about 28 to 
about 193 of Hgure 2 (SEQ ID N0:2), or a fragment thereof sufficient to provide a binding site for an antHIHEPO antSuidy. 

20 20. AnbolatedpolypeptideproducedbyWhybriiizingatestDM 

(a) a DMA molecule encod'mg a CHEPO polypeptide comprising the sequence of anano add residues from 1 or about 28 to 
about 193 of Rgure 2 (SEQ ID N0:2L or (b) the complement of the ONA molecule of (a), (ii) culturing a host cell comprising 
said test DMA molecule umier conditions suitable for the expression of said polypeptide, and (iii) recovering said polypeptide 
from the cell culture. 



25 
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21. The isolated polypeptide of Claim 20, wherein saM test ONA has at least about 80% sequence ideiitity 
to (a) or (b). 

22. A chimeric molecule comprising a CHEPO polypeptide fused to a heterologous amino add sequence. 

23. The chimeric molecuto of Claim 22, wherein said heterologous amino add sequence is an epitope tag 



24. The diimeric molecuto of Claim 22, wherem said heterologous ammo add sequence b a Fc region of 
35 an nnmonoglobuBn. 
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An antibody which specificaiiy binds to a CHEPO polypeptide. 
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26. The antibody of Claim 24, wherein said antibody is a monoclonal antibody. 

27. The antbody of Claim 24. wherein said antibody is a humanized antibody. 

28. An agonist to a CHEPO polypeptide. 

29. An antagonist to a CHEPO polypeptide. 

30. A composition of matter comprising (a) a CHEPO polypeptide, (b) an agonist to a CHEPO polypeptide, 
(c) an antagonist to a CHEPO polypeptide, or (d) an anti-CHEPO antibody in admixture with a phannaceuticalty acc^itabie 
carrier. 

31. A method of inducing erythropoiesis in a mammal, said method comprismg administering to s»d 
mammal an effective amount of a CHEPO polypeptide or an agonist thereto, wherein erythropoiesis in said mammal is 
induced. 

32. A method of inhibiting erythropoiesis in a mammal, said method comprising administering to said 
mammal an effecthe amount of an antagonist to a CHEPO polypeptide, wherein erythriqioiesis in said mammal is inhibited. 

33. The method according to Claim 31, wherein said antagonist is an anti-CHEPO antibody. 

34. A CHEPO polypeptide comprising an amino acid sequence selected from the group consisting of: 

(1) APPRUCDSRVifRYlLEAKEAENinGCAEHCSLNENITVPOTKVNFYAWKRNXSXOQAVEW 
KSSQPWEPUIUWDKAVSGLRSLTTLIIWLGAQKEAISP^^ 

6DR(SEai0N0:18); 

(2) APPRUCDSRVLERYlUAKEAENlTTGCABICSUiENm^ 
WSOPWEPIOLHVDKAVSBIHSITTURAIGAKKEAISPPOAASW 
60R|SEQIDN0:19); 

(3) APPRUCDSRVLERYUIAkEAENITTGCABICSLNENtTV^^ 
MSSOPWEPLOLHVDKAVSBmSLTTUHALfiAQKEAISPPDAASAAPL 
6DR(SEQIDN020k. 

W APPRUCOSRVIERYIIEAKEAEMITTGCAEHCSINENITVPDTKVNFYAWICRN^ 
IKSaPWEPLQLHVDKAVSGLRSLTTURALGAKKEAB 
GDR(SEQIDN0:21k 
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(5) APPRUCDSRVlERYUEAKEAENimi»»HCSLNEIIIITVPOTKVNFYAm^ 
VNSSQPWEPlQUWDKAVSGLRSLTTLUULMQKEAiSPPOAASAAPLRmAOT^ 
T60R(SEai0N0:22]; 

(6) AmUCDSRVLERYlliAKEAENrrTGCAEHCSUVENITVPDTKVNFYAWKRMNXSXQAVEWQGlAIJjEA^ 
VNSSQPWEPLQIMDKAVSGLRSLTTLUIALGAKKEAtSPPOAASAAPLRmAOTFRKLFRmNllRSI^ 
T60R(SEQION0:23|; 

(7) APPRUCDSRVIiRYLLEAKEAENinGCAEHCSLNENITVPOTKVNFYAWKRMNXTXaAVEVWQGLAU^U 
VIVSaiPWEPLOLHVDKAVSGLRSLTTLLRALGAQKEAISPPOAASAAPLRmA^ 
T6DR(SEQION0:24|; 

(8) APPRUCOSRVLERYUEAKEAENinGCAEHCSUtfEimVPDTIWIinnrAWiCRMNXimV^ 
VNSSQPWEPLQUIVOKAVSeUISLTTURALGAKKEAISPPOAASAAPUmADTffli^^ 
TG0R(SEQI0N0:2St; 

(9) APPRUCOSRVLBIYUEAKEAENinGCAEHCSLNENITVPimCVNFYAWKRMENXSXA 
VNSSQPWEPLbLHVDKAVSGUSLTTURALGAQKEAISPPOAASAAPLRmAOTra 
TGDR(SEQI0N0:26); 

(10) APPRUCOSRVLERYUiAKEAENITTGCAEHCSUtlENirVPDTKVNFYAWKRMENXSXAVEmOGlALLSEA^ 
VNSSQPWEPLQUlVOKAVSGLASLTTURALGAKKEAISPPOAASAAPLRmADTFRiaHmSNFU 
TG0R(SEQIDN0:27}; 

(1 II APPRUCDSRVURmEAKEAENITTGCAEHCSLNENnVPOTKVNFYAWKRMENXmVEVm 

VNSSQPWEPLQLHVDKAVSGUSLTTURAUAQKEAISPPOAASAAPLRmAOTra 

TGDRCSEQIDNOm- 
(121 AninjCIlSRVLERYUEAI(EAENin6CAEH(:SLNENITVPDTKV^ 

VNSSIlPllimilUtVDKAVSGLRSLTTIllUimKEAISPPOAASAAPU^ 

T6DR(SEQIDN0:29t; 
(131 APPRUCOSRVLBIYUEAKEAENITTGCAEHCSLNEIIIITVPinX^ 

NSSQPVmQLHWKAVSGlRSLTTURALGAaKEAISPPOAASAAPL^ 

6OR|SEQI0N0:30|: 
(Ml AinUCDSRVLBtYUEAKEAENITTGIMEHCSUeilTVPDII^ 

NSSIlPWEPimHVOKAVSGUSLTTLLnUJSAiaCEAiSPPDAA 

60R(SEQIONO:31); 

(151 APPRUCDSRVIiRYllEAKEAENinGCAEHCSUliENITVPOTKVNFYAWKRMEVNX^ 
NSSQPVVEPLQlHVDKAVSGUiSLTriUUU^AIlKEAiSPPDAASAAPUll^^ 
6DR(SE0J0N0:32);and 

(161 APPRUCDSRVlEHYUlAKEAENtTTGCAEHCSLNENITVPDTlCVNITAWKRMEra^ 

NSSQPVimQUIVDKAVSGUtSLTTLLRALGAKKEAISPPOAASAAPLRTITMTFRiai^ 
6DR (SEQ n N(I:33K wtnrain X is any amim) acid exceiK f or proliM. 
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35. TliaCHEropolypaptideacnnfingtoaaim34,wliichcain|ifB8sm 
thegmopconsitlingof: 

(1) MGVHECPAmWULSUSIAGLPVLiGAPPRUCDSinrURmEAKEAENITTGCAEHCSW 
SXQQAVEVWQGlALLSEAVLRGQAUVNSSOPVVEPLQlJWDKAVSGLRSLmLIU^ 
TFRKIFRVYSNFLRGKLKLYTGEACRTGOR (SEQ ID N0:34); 

(2) MGVHECPAWLWUlSLLSLIlGIJnrLGAPPRUCOSRVLERYLLEAKEAENinGCAEHCSLNENnWDm^ 
SXQOAVEmOGI^EAVIJIGQAUVNSSQPWEPlQUWOKAVSGLRSLTTIlRAimi^ 
TFRKLFRVYSNFLRGKLKLYTGEACRTGOR (SEQ 10 N0:35); 

(3) MGVHECPAmVtniLSLLSLPlGLPVLGAPPRUCDSRVLERYLLEAKEAENITTGCAEHCSLNENIWPDT^^ 
TXQOAVEVWQGULLSEAVUiGaALLVNSSQPWEPLQmVDKAVSGLRSlTTUJULGAQKEA^^ 
TFRKIFRVYSNFLRGKIKLYTGEACRTGOR SEQ ID HOm, 

(4) MGVHECPAmWUXSUSLPlGLPVUiAPPRUCOSRVlERyiLEAKEAENITTG^ 
TXimVEVWQGUUSEAVUtGQAamSOPWEPimHVOKAVSGUeiTTLlRAL^^ 
TFRKlFRWSNFlRGiaKLYTGEACRTGOR SEQ iO N0:37l- 

(5) MGVHECPAWLWLUmSLPLGLPVI^PPRUCOSRVlERYLlfAKEAENITTGCAEH 
XSXQAVEVWQGUUSEAVLRGQAUVNSSOPWEPLQLHVOKAVSGLRSLTTLLRALGAQXEAIS^ 
TFRKLFRVYSNnRGKLKLYTGEACRTGDR (SEQ 10 N0:38); 

m M6MCPAmWLLLSU£U>l£imGAPPRUC0SRVlERYUEAKEAENITTGC^^ 

XSXQAVEVWQGLAlLSEAVUlGQAUVNSSQPWEPimHVDKAVSGLRSLTTLLRAimK^ 

TraKLFRVYSNFLRGKUCLYTGEACRTGOR (SEQ 10 N0:3g); 
(7) MGVHECPAmWUlSU^IKGLPVlGAPPRUCDSRVLERYLLEAKEAENinGCAEHCSWENnVPD^ 

XTXQAVEVWQGLAUSEAVLRGQAUVNSSQPVI^PLQUWDKAVSGUISLTTLUMLGAQ^^ 

TFRnnimNHRGKlJCLYTGEACRTGDR (SEQ 10 NO?IO); 
(81 MGVHECPAmWUmLSlPLGLPVLGAPPRUCDSRVlERYUEAKEAENITTC 

mOAVEVWQGLAUSEAVUGOAUmSOPWEPLOUtVDKAVS^ 

1FRiafRmNnRGimYTGEACRT6DR (SEQ 10 NIMI); 

(9) M6VICf:PAWLWtUSUSU1£LPVL6APPRUCDSRViaYUIAKEAENITTGCABIC^ 
NXSXAVEVWQGUUSEAVlRGOAUVNSSOPWEPUIUIVDKAVSeUSin^^ 
TmiajmySNRRGKIJanGEACRTGOR (SEQ 10 

(10) MGVHECPAWLWIlSUSLPtGLPVUMPPRUCOSRVlERYLLEAl^ 
NXSXAVEVWQ6UU£EAVUI6QAUVNSSQPWEPLQLHVDXAVS6LRSLT1URALGAin(^ 
TFRKLFRVVSNFLRGKUaYTGEACRTGOR (SEQ ID N0:43); 

(11) MGVHECPAmWUlSUSIJ'LGLPVLGAPPflUCDSRVlERYUEAKEAENfnGCAEHCSWENiTVPOTONmW^ 
NXTXAVEVWQGUU5EAVIJIGQAUVNSS0PWEPIQIJW0KAVS6UBLTTIUWIGA0KEAISPP0 
TFRKLFRVYSNFLRGKLKLYTGEACRTGOR (SEQ ID N0:44}; 



96 



wo 00/68376 



PCTAJSOQ/12370 



(12) MOmEGPAm.WllUUStJULinn£APPRLIC0SIMBiYUEAi(^ 
NXTMVEVWQSL/UlSEAVlJmQALLmSSaPWEPlflLHVOKA^^ 
TFRKLFRVYSNFLRGXLKLYTBEACRTGOR (SEQ 10 NIMSk 

(13) M6VHECPAWLWlLLSllSU16UnfL6APraUC0SIIVLEinrU» 
VNXSXVEVWQ6UUSEAVUGQAaVNSSQPWERQU1VDKAVSGLRSLTTUiUaGAQI^ 
TFRKLFRVYSNFLRGKLKLYTGEACRTGDR (SEQ 10 m.m 

(14) MGVHECPAWLWUlSLLSLPL&imGAPraUCDSRVLERYLlEAKEAENITTGCAEHCSLNEM^ 
VNmVEVWQGUOLSEAVUlGQAUVNSSllPWLaLHVDXAVSeUiaTTUJiALGAKK^ 
TFRIOPRVYSNFLRGKLKLnGEACRTGDR (SEQ ID N0:47): 

(15) MGVHECPAMWLLLSIJ^m£U>VL6APPRUCDSRVURYUiAKEAEMTTGCAEHCSLNENITV^^ 
VIIXTXVEVmiAlLSEAVLRG(]AUVNSS(lPWEPLQLHVDI^^^ 
TFIUaHtVySNFlRGKUaYT6EA(»TG0R (SEQ ID N0:48); and 

(16) MGVHE(»>AWLWULSLlSLPL6LPyUAPPflUCDSiWLEimJiAi^^ 
VNXmEVWQGlAU^EAVLRGQAaVNSSQPVVEPUII^ 

TFRKIHIVYSNFLRGKLKLYT6EACRTGDR (SEQ 10 mM], wiranh X is any amino achi excapt for pratne. 

36. A cMmerie moheula comprising a CHEPO poiypeptida of (aaim 34 or 35 fused to a iwtaraiagoiit amin 
aekiseipienn. 

37. Tlndmnrieniolacu!edCiaini36,wii8ninsaidlBterologoiisanm 
saquenea. 

38. Tin diimeric molecuia of Claim 36, wharain said hatenlagoiis anina add sequence is a Fc regbn of 
animnumogioindin. 
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FIGURE 3 

human MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICDSRVLERYLLEAKEAE 
************************************************** 

chepo MGVHECPAWLWLLLSLLSLPIXniPVLGAPPRLICDSRVLERYLLEAKE;^ 
human NITTGCl^CSLNENIWPDTKVNFYAWKRMEVGQQAVEWQ^ 

★ ★★****«*4r***«***«^««*****ib«4^***** **************** 

chepo NITTGO^CSLNENIWPDTKVNFYAWKRMEVRQQAVEWQGLALLS^ 



human VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 
***************************************** ******** 

Chepo VLRGQALLVNSSQPWEPl^LHVDKAVSGLRSLTTLLRAI/^ 



human AASAAPLRTITADTFRKLFRVYSNFLRGKLKLYTGEACRTGDR 
******************************************* 

Chepo AASAAPLRTITADTFRKLFRVYSNFLRGKLKLYTGEACRTGDR 
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Continuation of Box 1.2 
Claims Nos.: 28, 30(b). 31(b) 



Renark (2): Present claims 28-32 relate to an extremely large number of 
possible CHEPO agonists and antagonists. Support within the meaning of 
Article 6 POT and/or disclosure within the meaning of Article 5 PCT Is to 
be found, however, for only a very small proportion of the antagonists 
claimed. In the present case, the claims so lack support, and the 
application so lacks disclosure, that a meaningful search over the whole 
of the claimed scope is Impossible. Consequently, the search has been 
carried out for those parts of the claims which appear to be supported 
and disclosed, namely those parts relating to antl-CHEPO antibodies, as 
claimed In claims 30 and 33. Claim 28, and those parts of claims 30 and 
31 where reference Is made to agonists to a CHEPO polypeptide, have not 
been searched. 

The applicant's attention Is drawn to the fact that claims, or parts of 
claims, relating to Inventions in respect of which no International 
search report has been established need not be the subject of an 
International preliminary examination (Rule 66.1(e) PCT). The applicant 
Is advised that the EPO policy when acting as an International 
Preliminary Examining Authority Is normally not to carry out a 
preliminary examination on matter which has not been searched. This Is 
the case Irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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